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This Application is a Continuation-in-Part of 
USSN 09/575,1 68 Filed 23 May 2000 

ORIENTATION SENSING DEVICE 

FIELD OF INVENTION 

5 The present invention relates generally to methods, systems and apparatus for 

interacting with computers. The invention relates particularly to a sensing device for 
sensing its own orientation relative to a surface when moved or positioned relative to the 
surface. 

The invention has been developed primarily to allow a large number of 
1 0 distributed users to interact with networked information via printed matter and optical 
sensors, thereby to obtain interactive printed matter on demand via high-speed 
networked color printers. Although the invention will largely be described herein with 
reference to this use, it will be appreciated that the invention is not limited to use in this 
field. 



15 CO-PENDING APPLICATIONS 

A*arious methods, systems and apparatus relating to the present invention are 

discloseam W following co-pending applications filed by the applicant or assignee of 
the present invention simultaneously with the present invention: 

NPA060US, NPA661US, NPA081US, NPA082US, NPP010US, NPP013US, 
20 NPP015US, NPP020US, NPP021US, NPP022US, NPP023US, NPS014US, 
NPS015US, NPS017US, NPS018US, NPS022US, NPS027US NPS028US, 
NPT008US, BINOIUSA BIN02US, BIN03US, BIN04US 

The disclosures of these co-pending applications are incorporated herein by 
25 cross-reference. Each applicationNis temporarily identified by its docket number. This 
will be replaced by the correspondingJJSSN when available. 

Various methods, systems anchapparatus relating to the present invention are 
disclosed in the following co-pending applications filed by the applicant or assignee of 
the present invention on 20 October 2000: \ 

30 NPA011US, NPA031US, NPA040US, NPA0^6US, NPA053US, NPA059US, 
NPA064US, NPB006US, NPS004US, NPS008WS, NPS013US, NPS024US, 
UP01US, UP02US, UP03US, UP04US,\ UP05US 
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le disclosures of these co-pending applications are incorporated herein by 
cross-referenoe. Each application is temporarily identified by its docket number. This 
will be replaced^by the corresponding USSN when available. 

Various methods, systems and apparatus relating to the present invention are 
disclosed in the following co-pending applications filed by the applicant or assignee of 
the present invention on\15 September 2000: 

NPA024US, NPA025U9y NPA047US, NPA049US 

The disclosures ofNthese co-pending applications are incorporated herein by 
cross-reference. Each application is temporarily identified by its docket number. This 
will be replaced by the corresponding USSN when available. 

Various methods, system^ and apparatus relating to the present invention are 
disclosed in the following co-pending applications filed by the applicant or assignee of 
the present invention on 30 June 2000: 

NPA014US, NPA015US, NPA022US\ NPA026US, NPA038US, NPA041US, 
NPA050US, NPA051US, NPA052US, \NPA063US, NPA065US, NPA067US, 
NPA068US, NPA069US, NPA071US, MPA072US, NPB003US, NPB004US, 
NPB005US, NPP019US, PEC04US, PfeC05US, PEC06US, PEC07US 

The disclosures of these co-pending applications are incorporated herein by 
cross-reference. Each application is temporarily identified by its docket number. This 
will be replaced by. the corresponding USSN when available. 

Various methods, systems and apparatus relating to the present invention are 
25 disclosed in the following co-pending applications filed \y the applicant or assignee of 
the present invention on 23 May 2000: 



15 



20 



NPA001US, 
NPA008US, 
NPA018US, 

30 NPA048US, 
NPK004US, 
NPN001US, 
NPP008US, 
NPS020US, 

35 NPX003US, 
IJM52US, 
MJ15US, 



NPA002US, 

NPA009US, 

NPA019US, 

NPA075US, 

NPK005US, 

NPP001US, 

NPP016US, 

NPT001US, 

NPX008US, 

MJ10US, 

MJ34US, 



NPA004US, 

NPA010US, 

NPA020US, 

NPB001US, 

NPM001US, 

NPP003US, 

NPP017US, 

NPT002US, 

NPX01 1US, 

MJ11US, 

MJ47US, 



NPA005US, 

NPA012US, 

NPA021US, 

NPB002US, 

NPM002US, 

NPP005US, 

NPP018US, 

NPT003US, 

NPX014US, 

MJ12US, 

MJ58US, 



NRA006US, 
NPA016US, 
NPA030US, 
NPK002US, 
NPM003US, 
NPP006CIS, 
NPS001US 
NPT004US,\ 
NPX016US, 
MJ13US, 
MJ62US, 



NPA007US, 
NPA017US, 
NPA035US, 
NPK003US, 
NPM004US, 
NPP007US, 
NPS003US, 
NPX001US, 
JJ52US, 
1J14US, 
1J63US, 
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PAK04US, PAK05US, PAK06US, PAK07US, PAK08US, PECO 1 US, 
PEC02US, PEC03US\ 

The disclosures oh these co-pending applications are incorporated herein by 
5 cross-reference. Each application is temporarily identified by its docket number. This 
will be replaced by the corresponding USSN when available. 
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BACKGROUND 

Presently, a user of a computer system typically interacts with the system using 
a monitor for displaying information and a keyboard and/or mouse for inputting 
information. Whilst such an interface is powerful, it is relatively bulky and non-portable. 
5 Information printed on paper can be easier to read and more portable than information 
displayed on a computer monitor. However, unlike a keyboard or mouse, a pen on paper 
generally lacks the ability to interact with computer software. 

OBJECT 

It is an object of the present invention to combine advantages of pen on paper 
1 0 and computer systems. 

SUMMARY OF INVENTION 

The present invention relates to a sensing device for generating orientation data 

when positioned or moved relative to a surface, the orientation data being indicative of 
an orientation of the sensing device relative to the surface, the surface having coded data 
1 5 disposed upon it, the coded data being indicative, when sensed by the sensing device, of 
the orientation, the sensing device including: 

a housing; 

orientation sensing means configured to generate the orientation data using at 
least some of the coded data; and 

20 communications means configured to communicate the orientation data to a 

computer system. 

In a preferred embodiment the orientation data is indicative of the yaw, pitch 
and/or roll of the housing relative to the surface. 

Preferably, the sensing device includes motion sensing means for generating 
25 movement data when the sensing device is moved relative to the surface, the 
communications means being configured to communicate the movement data to the 
computer system. 

Preferably also, the sensing device includes region identity sensing means 
configured to sense, when the sensing device is positioned or moved relative to a region 
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of the surface, and using at least some of the coded data, region identity data indicative 
of an identity of the region, the communications means being configured to 
communicate the region identity data to the computer system. 

The orientation sensing means preferably detects the orientation of the housing 
5 relative to the surface dynamically as the housing is moved. The housing may have an 
elongate shape which can be held by a user. In one embodiment, the housing has the 
shape of a pen. The housing may be provided with a marking nib for marking the 
surface, but this is not essential. 

By simultaneously capturing orientation and movement data the system may be 
10 used to verify a person's signature. Alternatively, dynamically-measured orientation 
signals can enable the housing to be used as a joystick. For example, such a joystick 
could be used with three-dimensional software applications. Note that it is not essential 
for the orientation sensing means to sense the orientation of the housing in all three 
dimensions. It may be sufficient to detect only the pitc, as some applications may not 
1 5 need three-dimensional orientation information. For example, the housing may be used 
to linearly control an aspect of a device, such as the intensity of a light or the volume of a 
loudspeaker, by varying the pitch between 0° and 90°. 

The roll, pitch and yaw may be calculated by detecting perspective distortion 
and rotation of the coded data. 

20 Firstly, it can be used to determine when the apparatus is first applied to the 

surface and when it leaves the surface, with motion between force application and 
removal being defined as a 'stroke' in freehand. The force data information can be time 
stamped. 

The apparatus is preferably a separate implement containing the appropriate 
25 means as discussed above. It may be any shape but it is preferably in the form of a stylus 
or pen. 

Preferably, the apparatus incorporates a marking nib for marking the surface 
with hand-drawn information, but this is not essential. 

The apparatus is preferably intended for interaction with a computer system 
30 that can be controlled and can interpret hand-drawn information (whether drawing or 
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writing) applied by a user via the device. Preferably, the sensing device is arranged to 
provide device identification information which uniquely identifies the device. The 
computer system may therefore use this to identify the device. 

Features and advantages of the present invention will become apparent from 
the following description of embodiments thereof, by way of example only, with 
reference to the accompanying drawings. 
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BRIEF DESCRIPTION OF DRAWINGS 

Preferred and other embodiments of the invention will now be described, by 

way of non-limiting example only, with reference to the accompanying drawings, in 
which: 

5 Figure 1 is a schematic of a the relationship between a sample printed netpage and its 
online page description; 

Figure 2 is a schematic view of a interaction between a netpage pen, a netpage printer, a 
netpage page server, and a netpage application server; 

Figure 3 illustrates a collection of netpage servers and printers interconnected via a 
1 0 network; 

Figure 4 is a schematic view of a high-level structure of a printed netpage and its online 
page description; 

Figure 5 a is a plan view showing a structure of a netpage tag; 

Figure 5b is a plan view showing a relationship between a set of the tags shown in Figure 
15 5a and a field of view of a netpage sensing device in the form of a netpage pen; 

Figure 6a is a plan view showing an alternative structure of a netpage tag; 

Figure 6b is a plan view showing a relationship between a set of the tags shown in Figure 
6a and a field of view of a netpage sensing device in the form of a netpage pen; 

Figure 6c is a plan view showing an arrangement of nine of the tags shown in Figure 6a 
20 where targets are shared between adjacent tags; 

Figure 6d is a plan view showing the interleaving and rotation of the symbols of the four 
codewords of the tag shown in Figure 6a; 

Figure 7 is a flowchart of a tag image processing and decoding algorithm; 

Figure 8 is a perspective view of a netpage pen and its associated tag-sensing field-of- 
25 view cone; 

Figure 9 is a perspective exploded view of the netpage pen shown in Figure 8; 

Figure 10 is a schematic block diagram of a pen controller for the netpage pen shown in 
Figures 8 and 9; 
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Figure 1 1 is a perspective view of a wall-mounted netpage printer; 

Figure 12 is a section through the length of the netpage printer of Figure 1 1 ; 

Figure 12a is an enlarged portion of Figure 12 showing a section of the duplexed print 
engines and glue wheel assembly; 

5 Figure 13 is a detailed view of the ink cartridge, ink, air and glue paths, and print engines 
of the netpage printer of Figures 1 1 and 12; 

Figure 14 is a schematic block diagram of a printer controller for the netpage printer 
shown in Figures 1 1 and 12; 

Figure 15 is a schematic block diagram of duplexed print engine controllers and 
1 0 Memjet™ printheads associated with the printer controller shown in Figure 14; 

Figure 16 is a schematic block diagram of the print engine controller shown in Figures 
14 and 15; 

Figure 17 is a perspective view of a single Memjet™ printing element, as used in, for 
example, the netpage printer of Figures 10 to 12; 

15 Figure 18 is a perspective view of a small part of an array of Memjet™ printing 
. elements; 

Figure 19 is a series of perspective views illustrating the operating cycle of the Memjet™ 
printing element shown in Figure 13; 

Figure 20 is a perspective view of a short segment of a pagewidth Memjet™ printhead; 
20 Figure 21 is a schematic view of a user class diagram; 

Figure 22 is a schematic view of a printer class diagram; 

Figure 23 is a schematic view of a pen class diagram; 

Figure 24 is a schematic view of an application class diagram; 

Figure 25 is a schematic view of a document and page description class diagram; 
25 Figure 26 is a schematic view of a document and page ownership class diagram; 

Figure 27 is a schematic view of a terminal element specialization class diagram; 

Figure 28 is a schematic view of a static element specialization class diagram; 
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Figure 29 is a schematic view of a hyperlink element class diagram; 

Figure 30 is a schematic view of a hyperlink element specialization class diagram; 

Figure 3 1 is a schematic view of a hyperlinked group class diagram; 

Figure 32 is a schematic view of a form class diagram; 
5 Figure 33 is a schematic view of a digital ink class diagram; 

Figure 34 is a schematic view of a field element specialization class diagram; 

Figure 35 is a schematic view of a checkbox field class diagram; 

Figure 36 is a schematic view of a text field class diagram; 

Figure 37 is a schematic view of a signature field class diagram; 
1 0 Figure 38 is a flowchart of an input processing algorithm; 

Figure 38a is a detailed flowchart of one step of the flowchart of Figure 38; 

Figure 39 is a schematic view of a page server command element class diagram; 

Figure 40 is a schematic view of a resource description class diagram; 

Figure 41 is a schematic view of a favorites list class diagram; 
1 5 Figure 42 is a schematic view of a history list class diagram; 

Figure 43 is a schematic view of a subscription delivery protocol; 

Figure 44 is a schematic view of a hyperlink request class diagram; 

Figure 45 is a schematic view of a hyperlink activation protocol; 

Figure 46 is a schematic view of a form submission protocol; 
20 Figure 47 is a schematic view of a commission payment protocol; 

Figure 48 is a schematic view of a set of radial wedges making up a symbol; 

Figure 49 is a schematic view of a ring A and B symbol allocation scheme; 

Figure 50 is a schematic view of a first ring C and D symbol allocation scheme; 

Figure 5 1 is a schematic view of a second ring C and D symbol allocation scheme; 
25 Figure 52 is a schematic view of a triangular tag packing; 
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Figure 53 is a perspective view of an icosahedron; 

Figure 54 is a perspective view of an icosahedral geodesic with frequency 3; 
Figure 55 is a schematic view of a minimum tag spacing; 
Figure 56 is a schematic view of a minimum tag spacing which avoids overlap; 
5 Figure 57 is a schematic view of a first tag insertion case; 
Figure 58 is a schematic view of a second tag insertion case; 
Figure 59 is a schematic view of a third tag insertion case; 
Figure 60 is a schematic view of a fourth tag insertion case; 
Figure 61 is a schematic view of a pen orientation relative to a surface; 
1 0 Figure 62 is a schematic view of a pen pitch geometry; 
Figure 63 is a schematic view of a pen roll geometry; 

Figure 64 is a schematic view of a pen coordinate space showing physical and optical 
axes of a pen; 

Figure 65 is a schematic view of a curved nib geometry; 

15 Figure 66 is a schematic view of an interaction between sampling frequency and tag 
frequency; 

Figure 67 is a schematic view of a pen optical path; 

Figure 68 is a flowchart of a stroke capture algorithm; and 

Figure 69 is a schematic view of a raw digital ink class diagram. 
20 Figure 70 is a table containing equations numbered 1 to 10; 

Figure 71 is a table containing equations numbered 1 1 to 20; 

Figure 72 is a table containing equations numbered 21 to 26; 

Figure 73 is a table containing equations numbered 27 to 34; 

Figure 74 is a table containing equations numbered 35 to 41; 
25 Figure 75 is a table containing equations numbered 42 to 44; 

Figure 76 is a table containing equations numbered 45 to 47; 
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Figure 88 is a table containing a set of degenerate forms of equations 64 to 71 ; 

Figure 89 is a first part of a table containing conditions and special handling for zero 
pitch and zero roll; and 

15 Figure 90 is the second part of the table of Figure 89. 
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DETAILED DESCRIPTION OF PREFERRED AND OTHER EMBODIMENTS 

Note: Memjet™ is a trade mark of Silverbrook Research Pty Ltd, Australia. 

In the preferred embodiment, the invention is configured to work with the 
netpage networked computer system, a detailed overview of which follows. It will be 
5 appreciated that not every implementation will necessarily embody all or even most of 
the specific details and extensions discussed below in relation to the basic system. 
However, the system is described in its most complete form to reduce the need for 
external reference when attempting to understand the context in which the preferred 
embodiments and aspects of the present invention operate. 

^ 10 In brief summary, the preferred form of the netpage system employs a computer 

vl! interface in the form of a mapped surface, that is, a physical surface which contains 

fn references to a map of the surface maintained in a computer system. The map references 

,!:^ can be queried by an appropriate sensing device. Depending upon the specific 

^3 implementation, the map references may be encoded visibly or invisibly, and defined in 

15 such a way that a local query on the mapped surface yields an unambiguous map 

JT reference both within the map and among different maps. The computer system can 

f y contain information about features on the mapped surface, and such information can be 

O retrieved based on map references supplied by a sensing device used with the mapped 

" rf surface. The information thus retrieved can take the form of actions which are initiated 

20 by the computer system on behalf of the operator in response to the operator's interaction 
with the surface features. 

In its preferred form, the netpage system relies on the production of, and human 
interaction with, netpages. These are pages of text, graphics and images printed on 
ordinary paper, but which work like interactive web pages. Information is encoded on 

25 each page using ink which is substantially invisible to the unaided human eye. The ink, 
however, and thereby the coded data, can be sensed by an optically imaging pen and 
transmitted to the netpage system. Substrates other than paper may be used. The encoded 
information in the preferred embodiment is an infrared absorptive ink and so an infrared 
sensitive optical sensor may be used. If desired other wavelengths may be used or 

30 sensing techniques other than optical sensing; one alternative is to use magnetic inks and 
sensors. 
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In the preferred form, active buttons and hyperlinks on each page can be 
clicked with the pen to request information from the network or to signal preferences to a 
network server. In one embodiment, text written by hand on a netpage is automatically 
recognized and converted to computer text in the netpage system, allowing forms to be 
5 filled in. In other embodiments, signatures recorded on a netpage are automatically 
verified, allowing e-commerce transactions to be securely authorized. 

As illustrated in Figure 1 , a printed netpage 1 can represent a interactive form 
which can be filled in by the user both physically, on the printed page, and 
"electronically", via communication between the pen and the netpage system. The 
10 example shows a "Request" form containing name and address fields and a submit 
O button. The netpage consists of graphic data 2 printed using visible ink, and coded data 3 

= i printed as a collection of tags 4 using invisible ink. The corresponding page description 

■Vf £ 5, stored on the netpage network, describes the individual elements of the netpage. In 

0 particular it describes the type and spatial extent (zone) of each interactive element (i.e. 
% q 1 5 text field or button in the example), to allow the netpage system to correctly interpret 
s ^ input via the netpage. The submit button 6, for example, has a zone 7 which corresponds 

1 ^ to the spatial extent of the corresponding graphic 8. 

J j As illustrated in Figure 2, the netpage pen 101, a preferred form of which is 

O shown in Figures 8 and 9 and described in more detail below, works in conjunction with 

20 a netpage printer 601, an Internet-connected printing appliance for home, office or 
mobile use. The pen is wireless and communicates securely with the netpage printer via 
a short-range radio link 9. If desired the pen may be connected to the system utilizing 
wires or an infrared transmitter, although both alternatives limit usability. 

The netpage printer 601, a preferred form of which is shown in Figures 11 to 
25 13 and described in more detail below, is able to deliver, periodically or on demand, 
personalized newspapers, magazines, catalogs, brochures and other publications, all 
printed at high quality as interactive netpages. Unlike a personal computer, the netpage 
printer is an appliance which can be, for example, wall-mounted adjacent to an area 
where the morning news is first consumed, such as in a user's kitchen, near a breakfast 
30 table, or near the household's point of departure for the day. It also comes in tabletop, 
desktop, portable and miniature versions. 
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Netpages printed at their point of consumption combine the ease-of-use of 
paper with the timeliness and interactivity of an interactive medium. 

As shown in Figure 2, the netpage pen 101 interacts with the coded data on a 
printed netpage 1 and communicates, via a short-range radio link 9, the interaction to a 
5 netpage printer. The printer 601 sends the interaction to the relevant netpage page server 
10 for interpretation. In appropriate circumstances, the page server sends a corresponding 
message to application computer software running on a netpage application server 13. 
The application server may in turn send a response which is printed on the originating 
printer. 

10 The netpage system is made considerably more convenient in the preferred 

embodiment by being used in conjunction with high-speed microelectromechanical 
system (MEMS) based inkjet (Memjet™) printers. In the preferred form of this 
technology, relatively high-speed and high-quality printing is made more affordable to 
consumers. In its preferred form, a netpage publication has the physical characteristics of 

15 a traditional newsmagazine, such as a set of letter-size glossy pages printed in full color 
on both sides, bound together for easy navigation and comfortable handling. 

The netpage printer exploits the growing availability of broadband Internet 
access. Cable service is available to 95% of households in the United States, and cable 
modem service offering broadband Internet access is already available to 20% of these. 

20 The netpage printer can also operate with slower connections, but either with longer 
delivery times or lower image quality or both. Indeed, the netpage system can be 
enabled using existing consumer inkjet and laser printers, although the system will 
operate more slowly and will therefore be less acceptable from a consumer's point of 
view. In other embodiments, the netpage system is hosted on a private intranet. In still 

25 other embodiments, the netpage system is hosted on a single computer or computer- 
enabled device, such as a printer. 

Netpage publication servers 14 on the netpage network are configured to 
deliver print-quality publications to netpage printers. Periodical publications are 
delivered automatically to subscribing netpage printers via pointcasting and multicasting 
30 Internet protocols. Personalized publications are filtered and formatted according to 
individual user profiles. 
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A netpage printer can be configured to support any number of pens, and a pen 
can work with any number of netpage printers. In the preferred implementation, each 
netpage pen has a unique identifier. A household may have a collection of colored 
netpage pens, one assigned to each member of the family. This allows each user to 
5 maintain a distinct profile with respect to a netpage publication server or application 
server, assuming that the assigned pen is only used by the respective family member. 
However, as explained below, other means may be used to identify a user. 

A netpage pen can also be registered with a netpage registration server 1 1 and 
linked to one or more payment card accounts. This allows e-commerce payments to be 
1 0 securely authorized using the netpage pen. The netpage registration server compares the 

*3 signature captured by the netpage pen with a previously registered signature, allowing it 

w 

J to authenticate the user's identity to an e-commerce server. Other biometrics can also be 

ijl used to verify identity. A version of the netpage pen includes fingerprint scanning, 

'I : ^ verified in a similar way by the netpage registration server. 

■ y 15 Although a netpage printer may deliver periodicals such as the morning 

!■■* newspaper without user intervention, it can be configured never to deliver unsolicited 

■jry junk mail. In its preferred form, it only delivers periodicals from subscribed or otherwise 

■;LJ authorized sources. In this respect, the netpage printer is unlike a fax machine or e-mail 

O account which is visible to any junk mailer who knows the telephone number or email 

20 address. Alternatively the entire system may be made visible to outside users or each 
user may be provided with the ability to expose their printer(s) to outside users. This may 
be by way of selecting outside users allowed too send junk mail. 

1 . Netpage System Architecture 

Each object model in the system is described using a Unified Modeling 

25 Language (UML) class diagram. A class diagram consists of a set of object classes 
connected by relationships, and two kinds of relationships are of interest here: 
associations and generalizations. An association represents some kind of relationship 
between objects, i.e. between instances of classes. A generalization relates actual classes, 
and can be understood in the following way: if a class is thought of as the set of all 

30 objects of that class, and class A is a generalization of class B, then B is simply a subset 
of A. 
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Each class is drawn as a rectangle labeled with the name of the class. It 
contains a list of the attributes of the class, separated from the name by a horizontal line, 
and a list of the operations of the class, separated from the attribute list by a horizontal 
line. In the class diagrams which follow, however, operations are never modeled. 

5 An association is drawn as a line joining two classes, optionally labeled at 

either end with the multiplicity of the association. The default multiplicity is one. An 
asterisk (*) indicates a multiplicity of "many", i.e. zero or more. Each association is 
optionally labeled with its name, and is also optionally labeled at either end with the role 
of the corresponding class. An open diamond indicates an aggregation association ("is- 
1 0 part-of '), and is drawn at the aggregator end of the association line. 

A generalization relationship ("is-a") is drawn as a solid line joining two 
classes, with an arrow (in the form of an open triangle) at the generalization end. 

When a class diagram is broken up into multiple diagrams, any class which is 
duplicated is shown with a dashed outline in all but the main diagram which defines it. It 
15 is shown with attributes only where it is defined. 

1.1 Netpages 

Netpages are the foundation on which a netpage network is built. They provide 

a paper-based user interface to published information and interactive services. 

A netpage consists of a printed page (or other surface region) invisibly tagged 
20 with references to an online description of the page. The tags may be printed on or into 
the surface of the page, may be in or on a sub-layer of the page or may be otherwise 
incorporated into the page. The online page description is maintained persistently by a 
netpage page server. The page description describes the visible layout and content of the 
page, including text, graphics and images. It also describes the input elements on the 
25 page, including buttons, hyperlinks, and input fields. The page descriptions of different 
netpages may share components, such as an image, although the netpages (and the 
associated page descriptions) are visibly different. The page description for each netpage 
may include references to these common components. A netpage allows markings made 
with a netpage pen on its surface to be simultaneously captured and processed by the 
30 netpage system. 
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Multiple netpages can share the same page description. However, to allow 
input through otherwise identical pages to be distinguished, each netpage is assigned a 
unique page identifier. This page ID has sufficient precision to distinguish between all 
netpages envisaged to be used in the environment of use. If the environment is small 
5 then the precision need not be as great as where the environment is large. 

Each reference to the page description is encoded in a printed tag. The tag 
identifies the unique page on which it appears, and thereby indirectly identifies the page 
description. In the preferred embodiments the tag also identifies its own position on the 
page. Characteristics of the tags are described in more detail below. 

1 0 Tags are printed in infrared-absorptive ink on any substrate which is infrared- 

l : t reflective, such as ordinary paper. Near-infrared wavelengths are invisible to the human 

^ eye but are easily sensed by a solid-state image sensor with an appropriate filter. A 

1 L| sensor sensitive to the relative wavelength or wavelengths may be used, in which case no 

£q filters are required. Other wavelengths may be used, with appropriate substrates and 

^ 15 sensors. 

A tag is sensed by an area image sensor in the netpage pen, decoded and the 
I U data encoded by the tag is transmitted to the netpage system, preferably via the nearest 

f i netpage printer. The pen is wireless and communicates with the netpage printer via a 

" w short-range radio link. Tags are sufficiently small and densely arranged that the pen can 

20 reliably image at least one tag even on a single click on the page. It is important that the 
pen recognize the tag and extract the page ID and position on every interaction with the 
page, since the interaction is stateless. Tags are error-correctably encoded to make them 
partially tolerant to surface damage. 

The netpage page server maintains a unique page instance for each printed 
25 netpage, allowing it to maintain a distinct set of user-supplied values for input fields in 
the page description for each printed netpage. 

The relationship between the page description, the page instance, and the 
printed netpage is shown in Figure 4. In the preferred embodiment the page instance is 
associated with both the netpage printer which printed it and, if known, the netpage user 
30 who requested it. It is not essential to the working of the invention in its basic form that 
the page instance be associated with either the netpage printer which printed the 
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corresponding physical page or the netpage user who requested it or for whom the page 
was printed. 

1.2 Netpage Tags 

1.2.1 Tag Data Content 

5 In a preferred form, each tag identifies the region in which it appears, and the 

location of that tag within the region. A tag may also contain flags which relate to the 
region as a whole or to the tag. One or more flag bits may, for example, signal a tag 
sensing device to provide feedback indicative of a function associated with the 
immediate area of the tag, without the sensing device having to refer to a description of 
1 0 the region. A netpage pen may, for example, illuminate an "active area" LED when in 
the zone of a hyperlink. 

As will be more clearly explained below, in a preferred embodiment, each tag 
contains an easily recognized invariant structure which aids initial detection, and which 
assists in minimizing the effect of any warp induced by the surface or by the sensing 
1 5 process. The tags preferably tile the entire page, and are sufficiently small and densely 
arranged that the pen can reliably image at least one tag even on a single click on the 
page. It is important that the pen recognize the page ID and position on every interaction 
with the page, since the interaction is stateless. 

In a preferred embodiment, the region to which a tag refers coincides with an 
20 entire page, and the region ID encoded in the tag is therefore synonymous with the page 
ID of the page on which the tag appears. In other embodiments, the region to which a tag 
refers can be an arbitrary subregion of a page or other surface. For example, it can 
coincide with the zone of an interactive element, in which case the region ID can directly 
identify the interactive element. 

25 Each tag typically contains 16 bits of tag ID, at least 90 bits of region ID, and a 

number of flag bits. Assuming a maximum tag density of 64 per square inch, a 16-bit tag 
ID supports a region size of up to 1024 square inches. Larger regions can be mapped 
continuously without increasing the tag ID precision simply by using abutting regions 
and maps. The distinction between a region ID and a tag ID is mostly one of 

30 convenience. For most purposes the concatenation of the two can be considered as a 
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globally unique tag ID. Conversely, it may also be convenient to introduce structure into 
the tag ID, for example to define the x and y coordinates of the tag. A 90-bit region ID 
allows 2 90 (~10 27 or a thousand trillion trillion) different regions to be uniquely 
identified. Tags may also contain type information, and a region may be tagged with a 
5 mixture of tag types. For example, a region may be tagged with one set of tags encoding 
x coordinates and another set, interleaved with the first, encoding y coordinates. It will 
be appreciated the region ID and tag ID precision may be more or less than just described 
depending on the environment in which the system will be used. 

1.2.2 Tag Data Encoding 

10 In one embodiment each tag contains 120 bits of information. The 120 bits of 

tag data are redundantly encoded using a (15, 5) Reed-Solomon code. This yields 360 
encoded bits consisting of 6 codewords of 15 4-bit symbols each. The (15, 5) code 
allows up to 5 symbol errors to be corrected per codeword, i.e. it is tolerant of a symbol 
error rate of up to 33% per codeword. 

1 5 Each 4-bit symbol is represented in a spatially coherent way in the tag, and the 

symbols of the six codewords are interleaved spatially within the tag. This ensures that a 
burst error (an error affecting multiple spatially adjacent bits) damages a minimum 
number of symbols overall and a minimum number of symbols in any one codeword, 
thus maximizing the likelihood that the burst error can be fully corrected. 

20 Any suitable error-correcting code code can be used in place of a (15, 5) Reed- 

Solomon code, for example a Reed-Solomon code with more or less redundancy, with 
the same or different symbol and codeword sizes; another block code; or a different kind 
of code, such as a convolutional code (see, for example, Stephen B. Wicker, Error 
Control Systems for Digital Communication and Storage, Prentice-Hall 1995, the 

25 contents of which a herein incorporated by cross-reference). 

1.2.3 Physical Tag Structure 

The physical representation of the tag, shown in Figure 5, includes fixed target 

structures 15, 16, 17 and variable data areas 18. The fixed target structures allow a 
sensing device such as the netpage pen to detect the tag and infer its three-dimensional 
30 orientation relative to the sensor. The data areas contain representations of the individual 
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bits of the encoded tag data. 

To achieve proper tag reproduction, the tag is rendered at a resolution of 
256x256 dots. When printed at 1600 dots per inch this yields a tag with a diameter of 
about 4 mm. At this resolution the tag is designed to be surrounded by a "quiet area" of 
5 radius 16 dots. Since the quiet area is also contributed by adjacent tags, it only adds 16 
dots to the effective diameter of the tag. 

The tag includes six target structures. A detection ring 15 allows the sensing 
device to initially detect the tag. The ring is easy to detect because it is rotationally 
invariant and because a simple correction of its aspect ratio removes most of the effects 
10 of perspective distortion. An orientation axis 16 allows the sensing device to determine 
, 3 the approximate planar orientation of the tag due to the yaw of the sensor. The 

^ orientation axis is skewed to yield a unique orientation. Four perspective targets 17 allow 

I U the sensing device to infer an accurate two-dimensional perspective transform of the tag 

m and hence an accurate three-dimensional position and orientation of the tag relative to 

; ^ 15 the sensor. 

i± All target structures are redundantly large to improve their immunity to noise. 

■jS The overall tag shape is circular. This supports, amongst other things, optimal 

tag packing on an irregular triangular grid, such as is required to tile an arbitrary non- 
planar surface. The tags may, however, be arranged at the apexes of any polygon having 

20 n apexes, where n ranges from 3 to infinity, as desired. In combination with the circular 
detection ring 15, this makes a circular arrangement of data bits within the tag optimal. 
As shown in Figure 48, to maximize its size, each data bit is represented by a radial 
wedge 510 in the form of an area bounded by two radial lines 512, a radially inner arc 
514 and a radially outer arc 516. Each wedge 510 has a minimum dimension of 8 dots at 

25 1600 dpi and is designed so that its base (i.e. its inner arc 514), is at least equal to this 
minimum dimension. The radial height of the wedge 510 is always equal to the 
minimum dimension. Each 4-bit data symbol is represented by an array 518 of 2x2 
wedges 510, as best shown in Figure 48. 

The 15 4-bit data symbols of each of the six codewords are allocated to the four 
30 concentric symbol rings 18a to 18d, shown in Figure 5, in interleaved fashion as shown 
in Figures 49 to 51. Symbols of first to sixth codewords 520-525 are allocated alternately 
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in circular progression around the tag. 

The interleaving is designed to maximize the average spatial distance between 
any two symbols of the same codeword. Other arrangements of the codewords or their 
data symbols may be utilized. 

5 The physical layout of the tags or the shape and/or arrangement of data symbols 

within each tag are nor essential to the working of the invention. It is merely necessary 
that each tag encode sufficient information for the intended use. The use of redundancy 
in the tag is preferred but, at its basic level, not truly essential to the working of the 
invention. As such other tag arrangements may be utilized. Examples of other tag 
10 structures are described in US Patents 5,625,412, 5,661,506, 5,477,012 and 5,852,434, 
and PCT application PCT/US98/20597, the contents of each of which are incorporated 
herein by reference. 

In ordier to support "single-click" interaction with a tagged region via a sensing 
device, the sensing device must be able to see at least one entire tag in its field of view 
15 no matter where in the region or at what orientation the sensing device is positioned. The 
required diameter of the field of view of the sensing device is therefore a function of the 
size and spacing of the tags. 

Assuming a circular tag shape, the minimum diameter of the sensor field of 
view is obtained when the tags are tiled on a equilateral triangular grid, as shown in 
20 Figure 6. 

1.2.4 Tag Image Processing and Decoding 

The tag image processing and decoding of a tag of figure 5 performed by a 

sensing device such as the netpage pen is shown in Figure 7. While a captured image is 
being acquired from the image sensor, the dynamic range of the image is determined (at 
25 20). The center of the range is then chosen as the binary threshold for the image 21 . The 
image is then thresholded and segmented into connected pixel regions (i.e. shapes 23) (at 
22). Shapes which are too small to represent tag target structures are discarded. The size 
and centroid of each shape is also computed. 

Binary shape moments 25 are then computed (at 24) for each shape, and these 
30 provide the basis for subsequently locating target structures. Central shape moments are 
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by their nature invariant of position, and can be easily made invariant of scale, aspect 
ratio and rotation. 

The ring target structure 15 is the first to be located (at 26). A ring has the 
advantage of being very well behaved when perspective-distorted. Matching proceeds by 
5 aspect-normalizing and rotation-normalizing each shape's moments. Once its second- 
order moments are normalized the ring is easy to recognize even if the perspective 
distortion was significant. The ring's original aspect and rotation 27 together provide a 
useful approximation of the perspective transform. 

The axis target structure 16 is the next to be located (at 28). Matching proceeds 
10 by applying the ring's normalizations to each shape's moments, and rotation-normalizing 
the resulting moments. Once its second-order moments are normalized the axis target is 
easily recognized. Note that one third order moment is required to disambiguate the two 
possible orientations of the axis. The shape is deliberately skewed to one side to make 
this possible. Note also that it is only possible to rotation-normalize the axis target after 
15 it has had the ring's normalizations applied, since the perspective distortion can hide the 
axis target's axis. The axis target's original rotation provides a useful approximation of 
the tag's rotation due to pen yaw 29. 

The four perspective target structures 17 are the last to be located (at 30). Good 
estimates of their positions are computed based on their known spatial relationships to 

20 the ring and axis targets, the aspect and rotation of the ring, and the rotation of the axis. 
Matching proceeds by applying the ring's normalizations to each shape's moments. Once 
their second-order moments are normalized the circular perspective targets are easy to 
recognize, and the target closest to each estimated position is taken as a match. The 
original centroids of the four perspective targets are then taken to be the perspective- 

25 distorted corners 3 1 of a square of known size in tag space, and an eight-degree-of- 
freedom perspective transform 33 is inferred (at 32) based on solving the well- 
understood equations relating the four tag-space and image-space point pairs (see 
Heckbert, P., Fundamentals of Texture Mapping and Image Warping, Masters Thesis, 
Dept. of EECS, U. of California at Berkeley, Technical Report No. UCB/CSD 89/516, 

30 June 1989, the contents of which are herein incorporated by cross-reference). 

The inferred tag-space to image-space perspective transform is used to project 
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(at 36) each known data bit position in tag space into image space where the real- valued 
position is used to bilinearly interpolate (at 36) the four relevant adjacent pixels in the 
input image. The previously computed image threshold 21 is used to threshold the result 
to produce the final bit value 37. 

5 Once all 360 data bits 37 have been obtained in this way, each of the six 60-bit 

Reed-Solomon codewords is decoded (at 38) to yield 20 decoded bits 39, or 120 decoded 
bits in total. Note that the codeword symbols are sampled in codeword order, so that 
codewords are implicitly de-interleaved during the sampling process. 

As mentioned above, the physical tag structure or encoding system is not 
1 0 essential to the invention and other physical arrangements of each tag may be used. It 
will be understood that the process for recognizing and decoding the tag image to 
retrieve the data encoded depends on the physical structure of the tag and the system 
used for redundantly encoding the data. 

The ring target 1 5 is only sought in a subarea of the image whose relationship 
15 to the image guarantees that the ring, if found, is part of a complete tag. If a complete tag 
is not found and successfully decoded, then no pen position is recorded for the current 
frame. Given adequate processing power and ideally a non-minimal field of view 193, an 
alternative strategy involves seeking another tag in the current image. 

The obtained tag data indicates the identity of the region containing the tag and 
20 the position of the tag within the region. An accurate position 35 of the pen nib in the 
region, as well as the overall orientation 35 of the pen, is then inferred (at 34) from the 
perspective transform 33 observed on the tag and the known spatial relationship between 
the pen's physical axis and the pen's optical axis. 

1.2.5 Alternative Tag Structures 

25 The tag structure just described is designed to allow both regular tilings of 

planar surfaces and irregular tilings of non-planar surfaces. Regular tilings are not, in 
general, possible on non-planar surfaces. In the more usual case of planar surfaces where 
regular tilings of tags are possible, i.e. surfaces such as sheets of paper and the like, more 
efficient tag structures can be used which exploit the regular nature of the tiling. 

30 An alternative tag structure more suited to a regular tiling is shown in Figure 
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6a. The alternative tag 4 is square and has four perspective targets 17. It is similar in 
structure to tags described by Bennett et al. in US Patent 5051746. The tag represents 
sixty 4-bit Reed-Solomon symbols 47, for a total of 240 bits. The tag represents each one 
bit as a dot 48, and each zero bit by the absence of the corresponding dot. The 
5 perspective targets are designed to be shared between adjacent tags, as shown in Figures 
6b and 6c. Figure 6b shows a square tiling of 16 tags and the corresponding minimum 
field of view 193, which must span the diagonals of two tags. Figure 6c shows a square 
tiling of nine tags, containing all one bits for illustration purposes. 

Using a (15, 7) Reed-Solomon code, 112 bits of tag data are redundantly 
10 encoded to produce 240 encoded bits. The four codewords are interleaved spatially 
"3 within the tag to maximize resilience to burst errors. Assuming a 16-bit tag ID as before, 

%j this allows a region ID of up to 92 bits. 

I ll The data-bearing dots 48 of the tag are designed to not overlap their neighbors, 

10 so that groups of tags cannot produce structures which resemble targets. This also saves 

~ 1 5 ink. The perspective targets therefore allow detection of the tag, so further targets are not 

required. Tag image processing proceeds as described in section 1.2.4 above, with the 

il: si* 

I U exception that steps. 26 and 28 are omitted. 

o Although the tag may contain an orientation feature to allow disambiguation of 

the four possible orientations of the tag relative to the sensor, it is also possible to embed 

20 orientation data in the tag data. For example, the four codewords can be arranged so that 
each tag orientation contains one codeword placed at that orientation, as shown in Figure 
6d, where each symbol is labelled with the number of its codeword (1-4) and the position 
of the symbol within the codeword (A-O). Tag decoding then consists of decoding one 
codeword at each orientation. Each codeword can either contain a single bit indicating 

25 whether it is the first codeword, or two bits indicating which codeword it is. The latter 
approach has the advantage that if, say, the data content of only one codeword is 
required, then at most two codewords need to be decoded to obtain the desired data. This 
may be the case if the region ID is not expected to change within a stroke and is thus 
only decoded at the start of a stroke. Within a stroke only the codeword containing the 

30 tag ID is then desired. Furthermore, since the rotation of the sensing device changes 
slowly and predictably within a stroke, only one codeword typically needs to be decoded 
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per frame. 

It is possible to dispense with perspective targets altogether and instead rely on 
the data representation being self-registering. In this case each bit value (or multi-bit 
value) is typically represented by an explicit glyph, i.e. no bit value is represented by the 
5 absence of a glyph. This ensures that the data grid is well-populated, and thus allows the 
grid to be reliably identified and its perspective distortion detected and subsequently 
corrected during data sampling. To allow tag boundaries to be detected, each tag data 
must contain a marker pattern, and these must be redundantly encoded to allow reliable 
detection. The overhead of such marker patterns is similar to the overhead of explicit 
10 perspective targets. One such scheme uses dots positioned a various points relative to 
grid vertices to represent different glyphs and hence different multi-bit values (see Anoto 
Technology Description, Anoto April 2000). 

1.2.6 Tag Map 

Decoding a tag results in a region ID, a tag ID, and a tag-relative pen transform. 
15 Before the tag ID and the tag-relative pen location can be translated into an absolute 
location within the tagged region, the location of the tag within the region must be 
known. This is given by a tag map, a function which maps each tag ID in a tagged region 
to a corresponding location. The tag map class diagram is shown in Figure 22, as part of 
the netpage printer class diagram. 

20 A tag map reflects the scheme used to tile the surface region with tags, and this 

can vary according to surface type. When multiple tagged regions share the same tiling 
scheme and the same tag numbering scheme, they can also share the same tag map. 

The tag map for a region must be retrievable via the region ID. Thus, given a 
region ID, a tag ID and a pen transform, the tag map can be retrieved, the tag ID can be 
25 translated into an absolute tag location within the region, and the tag-relative pen 
location can be added to the tag location to yield an absolute pen location within the 
region. 

1.2.7 Tagging Schemes 

Two distinct surface coding schemes are of interest, both of which use the tag 

30 structure described earlier in this section. The preferred coding scheme uses "location- 
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indicating" tags as already discussed. An alternative coding scheme uses "object- 
indicating" tags. 

A location-indicating tag contains a tag ID which, when translated through the 
tag map associated with the tagged region, yields a unique tag location within the region. 
5 The tag-relative location of the pen is added to this tag location to yield the location of 
the pen within the region. This in turn is used to determine the location of the pen 
relative to a user interface element in the page description associated with the region. 
Not only is the user interface element itself identified, but a location relative to the user 
interface element is identified. Location-indicating tags therefore trivially support the 
10 capture of an absolute pen path in the zone of a particular user interface element. 

"5 An object-indicating tag contains a tag ID which directly identifies a user 

interface element in the page description associated with the region. All the tags in the 
1 U zone of the user interface element identify the user interface element, making them all 

m identical and therefore indistinguishable. Object-indicating tags do not, therefore, 

^ 15 support the capture of an absolute pen path. They do, however, support the capture of a 

I : ~ relative pen path. So long as the position sampling frequency exceeds twice the 

if U encountered tag frequency, the displacement from one sampled pen position to the next 

within a stroke can be unambiguously determined. 

i z -t 

With either tagging scheme, the tags function in cooperation with associated 
20 visual elements on the netpage as user interactive elements in that a user can interact 
with the printed page using an appropriate sensing device in order for tag data to be read 
by the sensing device and for an appropriate response to be generated in the netpage 
system. 

1 .3 Document and Page Descriptions 

25 A preferred embodiment of a document and page description class diagram is 

shown in Figures 25 and 26. 

In the netpage system a document is described at three levels. At the most 
abstract level the document 836 has a hierarchical structure whose terminal elements 839 
are associated with content objects 840 such as text objects, text style objects, image 
30 objects, etc. Once the document is printed on a printer with a particular page size and 
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according to a particular user's scale factor preference, the document is paginated and 
otherwise formatted. Formatted terminal elements 835 will in some cases be associated 
with content objects which are different from those associated with their corresponding 
terminal elements, particularly where the content objects are style-related. Each printed 
5 instance of a document and page is also described separately, to allow input captured 
through a particular page instance 830 to be recorded separately from input captured 
through other instances of the same page description. 

The presence of the most abstract document description on the page server 
allows a user to request a copy of a document without being forced to accept the source 
10 document's specific format. The user may be requesting a copy through a printer with a 
different page size, for example. Conversely, the presence of the formatted document 
description on the page server allows the page server to efficiently interpret user actions 
on a particular printed page. 

A formatted document 834 consists of a set of formatted page descriptions 5, 
15 each of which consists of a set of formatted terminal elements 835. Each formatted 
element has a spatial extent or zone 58 on the page. This defines the active area of input 
elements such as hyperlinks and input fields. 

A document instance 83 1 corresponds to a formatted document 834. It consists 
of a set of page instances 830, each of which corresponds to a page description 5 of the 
20 formatted document. Each page instance 830 describes a single unique printed netpage 1, 
and records the page ID 50 of the netpage. A page instance is not part of a document 
instance if it represents a copy of a page requested in isolation. 

A page instance consists of a set of terminal element instances 832. An element 
instance only exists if it records instance-specific information. Thus, a hyperlink instance 
25 exists for a hyperlink element because it records a transaction ID 55 which is specific to 
the page instance, and a field instance exists for a field element because it records input 
specific to the page instance. An element instance does not exist, however, for static 
elements such as textflows. 

A terminal element can be a static element 843, a hyperlink element 844, a 
30 field element 845 or a page server command element 846, as shown in Figure 27. A 
static element 843 can be a style element 847 with an associated style object 854, a 
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textflow element 848 with an associated styled text object 855, an image element 849 
with an associated image element 856, a graphic element 850 with an associated graphic 
object 857, a video clip element 851 with an associated video clip object 858, an audio 
clip element 852 with an associated audio clip object 859, or a script element 853 with 
5 an associated script object 860, as shown in Figure 28. 

A page instance may have a background field 833 which is used to record any 
digital ink captured on the page which does not apply to a specific input element. 

In the preferred form of the invention, a tag map 81 1 is associated with each 
page instance to allow tags on the page to be translated into locations on the page. 

10 1.4 The Netpage Network 

In a preferred embodiment, a netpage network consists of a distributed set of 

netpage page servers 10, netpage registration servers 11, netpage ID servers 12, netpage 
application servers 13, netpage publication servers 14, and netpage printers 601 
connected via a network 19 such as the Internet, as shown in Figure 3. 

15 The netpage registration server 11 is a server which records relationships 

between users, pens, printers, applications and publications, and thereby authorizes 
various network activities. It authenticates users and acts as a signing proxy on behalf of 
authenticated users in application transactions. It also provides handwriting recognition 
services if desired. As described above, a netpage page server 10 maintains persistent 

20 information about page descriptions and page instances. The netpage network includes 
any number of page servers, each handling a subset of page instances. Since a page 
server also maintains user input values for each page instance, clients such as netpage 
printers send netpage input directly to the appropriate page server. The page server 
interprets any such input relative to the description of the corresponding page. 

25 A netpage ID server 12 allocates document EDs 51 on demand, and provides 

load-balancing of page servers via its ID allocation scheme. 

A netpage printer uses the Internet Distributed Name System (DNS), or similar, 
to resolve a netpage page ID 50 into the network address of the netpage page server 
handling the corresponding page instance. 

30 A netpage application server 13 is a server which hosts interactive netpage 
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applications. A netpage publication server 14 is an application server which publishes 
netpage documents to netpage printers. They are described in detail in Section 2. 

Netpage servers can be hosted on a variety of network server platforms from 
manufacturers such as IBM, Hewlett-Packard, and Sun. Multiple netpage servers can run 
5 concurrently on a single host, and a single server can be distributed over a number of 
hosts. Some or all of the functionality provided by netpage servers, and in particular the 
functionality provided by the ID server and the page server, can also be provided directly 
in a netpage appliance such as a netpage printer, in a computer workstation, or on a local 
network. 

10 1 .5 The Netpage Printer 

The netpage printer 601 is an appliance which is registered with the netpage 

system and prints netpage documents on demand and via subscription. Each printer has a 
unique printer ID 62, and is connected to the netpage network via a network such as the 
Internet, ideally via a broadband connection. 

1 5 Apart from identity and security settings in non- volatile memory, the netpage 

printer need not contain any persistent storage. As far as a user is concerned, "the 
network is the computer". Netpages function interactively across space and time with the 
help of the distributed netpage page servers 10, independently of particular netpage 
printers. 

20 The netpage printer receives subscribed netpage documents from netpage 

publication servers 14. Each document is distributed in two parts: the page layouts, and 
the actual text and image objects which populate the pages. Because of personalization, 
page layouts are typically specific to a particular subscriber and so are pointcast to the 
subscriber's printer via the appropriate page server. Text and image objects, on the other 

25 hand, are typically shared with other subscribers, and so are multicast to all subscribers' 
printers and the appropriate page servers. 

The netpage publication server optimizes the segmentation of document 
content into pointcasts and multicasts. After receiving the pointcast of a document's page 
layouts, the printer knows which multicasts, if any, to listen to. 

30 Once the printer has received the complete page layouts and objects that define 
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the document to be printed, it can print the document. 

The printer rasterizes and prints odd and even pages simultaneously on both 
sides of the sheet. It contains duplexed print engine controllers 760 and print engines 
utilizing Memjet™ printheads 350 for this purpose. 

5 The printing process consists of two decoupled stages: rasterization of page 

descriptions, and expansion and printing of page images. The raster image processor 
(RIP) consists of one or more standard DSPs 757 running in parallel. The duplexed print 
engine controllers consist of custom processors which expand, dither and print page 
images in real time, synchronized with the operation of the printheads in the print 
10 engines. 

$\ 

I ;i Printers not enabled for invisible IR printing have the option to print tags using 

I U IR-absorptive black ink, although this restricts tags to otherwise empty areas of the page. 

O Although such pages have more limited functionality than invisible IR-printed pages, 

^ they are still classed as netpages. 

15 A normal netpage printer prints netpages on sheets of paper. More specialized 

I* netpage printers may print onto more specialized surfaces, such as globes or sheets of 

Iff plastics. Each printer supports at least one surface type, and supports at least one tag 

1-5 tiling scheme, and hence tag map, for each surface type. The tag map 811 which 

describes the tag tiling scheme actually used to print a document becomes associated with 

20 that document so that the document's tags can be correctly interpreted. 

Figure 2 shows the netpage printer class diagram, reflecting printer-related 
information maintained by a registration server 1 1 on the netpage network. 

A preferred embodiment of the netpage printer is described in greater detail in 
Section 6 below, with reference to Figures 11 to 16. 

25 1.5.1 Memjet™ Printheads 

The netpage system can operate using printers made with a wide range of 

digital printing technologies, including thermal inkjet, piezoelectric inkjet, laser 
electrophotographic, and others. However, for wide consumer acceptance, it is desirable 
that a netpage printer have the following characteristics: 
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• photographic quality color printing 

• high quality text printing 

• high reliability 

• low printer cost 
5 • low ink cost 

• low paper cost 

• simple operation 

• nearly silent printing 

• high printing speed 

10 • simultaneous double sided printing 

• compact form factor 

• low power consumption 

No currently commercially available printing technology has all of these 
characteristics. 

15 To enable production of printers with these characteristics, the present 

applicant has invented a new print technology, referred to as Memjet™ technology. 
Memjet™ is a drop-on-demand inkjet technology that incorporates page width printheads 
fabricated using microelectromechanical systems (MEMS) technology. Figure 17 shows 
a single printing element 300 of a Memjet™ printhead. The netpage wallprinter 

20 incorporates 168960 printing elements 300 to form a 1600 dpi pagewidth duplex printer. 
This printer simultaneously prints cyan, magenta, yellow, black, and infrared inks as well 
as paper conditioner and ink fixative. 

The printing element 300 is approximately 110 microns long by 32 microns 
wide. Arrays of these printing elements are formed on a silicon substrate 301 that 
25 incorporates CMOS logic, data transfer, timing, and drive circuits (not shown). 

Major elements of the printing element 300 are the nozzle 302, the nozzle rim 
303, the nozzle chamber 304, the fluidic seal 305, the ink channel rim 306, the lever arm 
307, the active actuator beam pair 308, the passive actuator beam pair 309, the active 
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actuator anchor 310, the passive actuator anchor 31 1, and the ink inlet 312. 

The active actuator beam pair 308 is mechanically joined to the passive 
actuator beam pair 309 at the join 319. Both beams pairs are anchored at their respective 
anchor points 310 and 311. The combination of elements 308, 309, 310, 311, and 319 
5 form a cantilevered electrothermal bend actuator 320. 

Figure 18 shows a small part of an array of printing elements 300, including a 
cross section 315 of a printing element 300. The cross section 315 is shown without ink, 
to clearly show the ink inlet 312 that passes through the silicon wafer 301 . 

Figures 19(a), 19(b) and 19(c) show the operating cycle of a Memjet™ printing 
10 element 300. 

Figure 19(a) shows the quiescent position of the ink meniscus 316 prior to 
printing an ink droplet. Ink is retained in the nozzle chamber by surface tension at the ink 
meniscus 316 and at the fluidic seal 305 formed between the nozzle chamber 304 and the 
ink channel rim 306. 

15 While printing, the printhead CMOS circuitry distributes data from the print 

engine controller to the correct printing element, latches the data, and buffers the data to 
drive the electrodes 318 of the active actuator beam pair 308. This causes an electrical 
current to pass through the beam pair 308 for about one microsecond, resulting in Joule 
heating. The temperature increase resulting from Joule heating causes the beam pair 308 

20 to expand. As the passive actuator beam pair 309 is not heated, it does not expand, 
resulting in a stress difference between the two beam pairs. This stress difference is 
partially resolved by the cantilevered end of the electrothermal bend actuator 320 
bending towards the substrate 301. The lever arm 307 transmits this movement to the 
nozzle chamber 304. The nozzle chamber 304 moves about two microns to the position 

25 shown in Figure 19(b). This increases the ink pressure, forcing ink 321 out of the nozzle 
302, and causing the ink meniscus 316 to bulge. The nozzle rim 303 prevents the ink 
meniscus 316 from spreading across the surface of the nozzle chamber 304. 

As the temperature of the beam pairs 308 and 309 equalizes, the actuator 320 
returns to its original position. This aids in the break-off of the ink droplet 3 1 7 from the 
30 ink 321 in the nozzle chamber, as shown in Figure 19(c). The nozzle chamber is refilled 
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by the action of the surface tension at the meniscus 316. 

Figure 20 shows a segment of a printhead 350. In a netpage printer, the length 
of the printhead is the full width of the paper (typically 210 mm) in the direction 351. 
The segment shown is 0.4 mm long (about 0.2% of a complete printhead). When 
5 printing, the paper is moved past the fixed printhead in the direction 352. The printhead 
has 6 rows of interdigitated printing elements 300, printing the six colors or types of ink 
supplied by the ink inlets 312. 

To protect the fragile surface of the printhead during operation, a nozzle guard 
wafer 330 is attached to the printhead substrate 301. For each nozzle 302 there is a 
10 corresponding nozzle guard hole 331 through which the ink droplets are fired. To 
3 prevent the nozzle guard holes 331 from becoming blocked by paper fibers or other 

debris, filtered air is pumped through the air inlets 332 and out of the nozzle guard holes 
1 U during printing. To prevent ink 321 from drying, the nozzle guard is sealed while the 

3 ! q printer is idle. 

15 1.6 The Netpage Pen 

l The active sensing device of the netpage system is typically a pen 101, which, 

i 'U using its embedded controller 134, is able to capture and decode IR position tags from a 

p page via an image sensor. The image sensor is a solid-state device provided with an 

' " appropriate filter to permit sensing at only near-infrared wavelengths. As described in 

20 more detail below, the system is able to sense when the nib is in contact with the surface, 
and the pen is able to sense tags at a sufficient rate to capture human handwriting (i.e. at 
200 dpi or greater and 100 Hz or faster). Information captured by the pen is encrypted 
and wirelessly transmitted to the printer (or base station), the printer or base station 
interpreting the data with respect to the (known) page, or, in the preferred embodiment, 
25 transmitting the information to a netpage server for interpretation. 

The preferred embodiment of the netpage pen operates both as a marking ink 
pen and as a non-marking stylus. The marking aspect, however, is not necessary for 
using the netpage system as a browsing system, such as when it is used as an Internet 
interface. Each netpage pen is registered with the netpage system and has a unique pen 
30 ID 61 . Figure 23 shows the netpage pen class diagram, reflecting pen-related information 
maintained by a registration server 1 1 on the netpage network. 
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When either nib is in contact with a netpage, the pen determines its position 
and orientation relative to the page. The nib is attached to a force sensor, and the force 
on the nib is interpreted relative to a threshold to indicate whether the pen is "up" or 
"down". This allows a interactive element on the page to be 'clicked' by pressing with 
5 the pen nib, in order to request, say, information from a network. Furthermore, the force 
is captured as a continuous value to allow, say, the full dynamics of a signature to be 
verified. The nib may be movable when subject to a specified force which is greater than 
that normally applied when writing. To "click" the user applies a force sufficient to 
move the nib. This may provide more desirable feedback to the user compared to that 
1 0 provided by a non-moving nib. 

The pen determines the position and orientation of its nib on the netpage by 
imaging, in the infrared spectrum, an area 193 of the page in the vicinity of the nib. It 
decodes the nearest tag and computes the position of the nib relative to the tag from the 
observed perspective distortion on the imaged tag and the known geometry of the pen 
1 5 optics. Although the position resolution of the tag may be low, because the tag density 
on the page is inversely proportional to the tag size, the adjusted position resolution is 
quite high, exceeding the minimum resolution required for accurate handwriting 
recognition. 

Pen actions relative to a netpage are captured as a series of strokes. A stroke 
20 consists of a sequence of time-stamped pen positions on the page, initiated by a pen- 
down event and completed by the subsequent pen-up event. A stroke is also tagged with 
the page ID 50 of the netpage whenever the page ID changes, which, under normal 
circumstances, is at the commencement of the stroke. 

Each netpage pen has a current selection 826 associated with it, allowing the 
25 user to perform copy and paste operations etc. The selection is time-stamped to allow the 
system to discard it after a defined time period. The current selection describes a region 
of a page instance. It consists of the most recent digital ink stroke captured through the 
pen relative to the background area of the page. It is interpreted in an application-specific 
manner once it is submitted to an application via a selection hyperlink activation. 

30 Each pen has a current nib 824. This is the nib last notified by the pen to the 

system. In the case of the default netpage pen described above, either the marking ink nib 
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or the non-marking stylus nib is current. Each pen also has a current nib style 825. This 
is the nib style last associated with the pen by an application, e.g. in response to the user 
selecting a color from a palette. The default nib style is the nib style associated with the 
current nib. Strokes captured through a pen are tagged with the current nib style. When 
5 the strokes are subsequently reproduced, they are reproduced in the nib style with which 
they are tagged. 

Whenever the pen is within range of a printer with which it can communicate, 
the pen slowly flashes its "online" LED. When the pen fails to decode a stroke relative to 
the page, it momentarily activates its "error" LED. When the pen succeeds in decoding a 
1 0 stroke relative to the page, it momentarily activates its "ok" LED. 

A sequence of captured strokes is referred to as digital ink. Digital ink forms 
the basis for the digital exchange of drawings and handwriting, for online recognition of 
handwriting, and for online verification of signatures. 

The pen is wireless and transmits digital ink to the netpage printer via a short- 
15 range radio link. The transmitted digital ink is encrypted for privacy and security and 
packetized for efficient transmission, but is always flushed on a pen-up event to ensure 
timely handling in the printer. 

When the pen is out of range of a printer it buffers digital ink in internal 
memory, which has a capacity of over ten minutes of continuous handwriting. When the 
20 pen is once again within range of a printer, it transfers any buffered digital ink. The 
buffer may provide more or less buffer capacity. 

A pen can be registered with any number of printers, but because all state data 
resides in netpages both on paper and on the network, it is largely immaterial which 
printer a pen is communicating with at any particular time. 

25 A preferred embodiment of the pen is described in greater detail in Section 6 

below, with reference to Figures 8 to 10. 

1.7 Netpage Interaction 

The netpage printer 601 receives data relating to a stroke from the pen 101 

when the pen is used to interact with a netpage 1 . The coded data 3 of the tags 4 is read 
30 by the pen when it is used to execute a movement, such as a stroke. The data allows the 
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identity of the particular page and associated interactive element to be determined and an 
indication of the relative positioning of the pen relative to the page to be obtained. The 
indicating data is transmitted to the printer, where it resolves, via the DNS, the page ID 
50 of the stroke into the network address of the netpage page server 10 which maintains 
5 the corresponding page instance 830. It then transmits the stroke to the page server. If the 
page was recently identified in an earlier stroke, then the printer may already have the 
address of the relevant page server in its cache. Each netpage consists of a compact page 
layout maintained persistently by a netpage page server (see below). The page layout 
refers to objects such as images, fonts and pieces of text, typically stored elsewhere on 
1 0 the netpage network. 

When the page server receives the stroke from the pen, it retrieves the page 
description to which the stroke applies, and determines which element of the page 
description the stroke intersects. It is then able to interpret the stroke in the context of the 
type of the relevant element. 

15 A "click" is typically a stroke where the distance and time between the pen 

down position and the subsequent pen up position are both less than some small 
maximum. An object which is activated by a click typically requires a click to be 
activated, and accordingly, a longer stroke is ignored. The failure of a pen action, such as 
a "sloppy" click, to register is indicated by the lack of response from the pen's "ok" 

20 LED. However, where a netpage includes a button a "click" can be registered when both 
the pen down and pen up positions are both within the area of the button. 

There are two kinds of input elements in a netpage page description: hyperlinks 
and form fields. Input through a form field can also trigger the activation of an associated 
hyperlink. 

25 1.7.1 Hyperlinks 

A hyperlink is a means of sending a message to a remote application, and 

typically elicits a printed response in the netpage system. 

A hyperlink element 844 identifies the application 71 which handles activation 
of the hyperlink, a link ID 54 which identifies the hyperlink to the application, an "alias 
30 required" flag which asks the system to include the user's application alias ED 65 in the 
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hyperlink activation, and a description which is used when the hyperlink is recorded as a 
favorite or appears in the user's history. The hyperlink element class diagram is shown in 
Figure 29. 

When a hyperlink is activated, the page server sends a request to an application 
5 somewhere on the network. The application is identified by an application ID 64, and the 
application ID is resolved in the normal way via the DNS. There are three types of 
hyperlinks: general hyperlinks 863, form hyperlinks 865, and selection hyperlinks 864, 
as shown in Figure 30. A general hyperlink can implement a request for a linked 
document, or may simply signal a preference to a server. A form hyperlink submits the 
10 corresponding form to the application. A selection hyperlink submits the current 
selection to the application. If the current selection contains a single-word piece of text, 
for example, the application may return a single-page document giving the word's 
meaning within the context in which it appears, or a translation into a different language. 
Each hyperlink type is characterized by what information is submitted to the application. 

15 The corresponding hyperlink instance 862 records a transaction ID 55 which 

can be specific to the page instance on which the hyperlink instance appears. The 
transaction ID can identify user-specific data to the application, for example a "shopping 
cart" of pending purchases maintained by a purchasing application on behalf of the user. 

The system includes the pen's current selection 826 in a selection hyperlink 
20 activation. The system includes the content of the associated form instance 868 in a form 
hyperlink activation, although if the hyperlink has its "submit delta" attribute set, only 
input since the last form submission is included. The system includes an effective return 
path in all hyperlink activations. 

A hyperlinked group 866 is a group element 838 which has an associated 
25 hyperlink, as shown in Figure 31. When input occurs through any field element in the 
group, the hyperlink 844 associated with the group is activated. A hyperlinked group can 
be used to associate hyperlink behavior with a field such as a checkbox. It can also be 
used, in conjunction with the "submit delta" attribute of a form hyperlink, to provide 
continuous input to an application. It can therefore be used to support a "blackboard" 
30 interaction model, i.e. where input is captured and therefore shared as soon as it occurs. 
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1.7.2 Forms 

A form defines a collection of related input fields used to capture a related set 
of inputs through one or more printed netpages. A form allows a user to submit one or 
more parameters to an application software program running on a server. 

5 A form 867 is a group element 838 in the document hierarchy. It ultimately 

contains a set of terminal field elements 839. A form instance 868 represents a printed 
instance of a form. It includes a set of field instances 870 which correspond to the field 
elements 845 of the form. Each field instance has an associated value 871, whose type 
depends on the type of the corresponding field element. Each field value records input 
1 0 through a particular printed form instance, i.e. through one or more printed netpages. The 
■p form class diagram is shown in Figure 32. 

4 Each form instance has a status 872 which indicates whether the form is active, 

f : [j frozen, submitted, void or expired. A form is active when first printed. A form becomes 

frozen once it is signed. A form becomes submitted once one of its submission 
M 15 hyperlinks has been activated, unless the hyperlink has its "submit delta" attribute set. A 

form becomes void when the user invokes a void form, reset form or duplicate form page 
^ command. A form expires when the time the form has been active exceeds the form's 

\J\ specified lifetime. While the form is active, form input is allowed. Input through a form 

rjj which is not active is instead captured in the background field 833 of the relevant page 

20 instance. When the form is active or frozen, form submission is allowed. Any attempt to 

submit a form when the form is not active or frozen is rejected, and instead elicits a form 

status report. 

Each form instance is preferably associated (at 59) with any form instances 
derived from it, thus providing a version history. This allows all but the latest version of 
25 a form in a particular time period to be excluded from a search. 

All input is captured as digital ink. Digital ink 873 consists of a set of time- 
stamped stroke groups 874, each of which consists of a set of styled strokes 875. Each 
stroke consists of a set of time-stamped pen positions 876, each of which also includes 
pen orientation and nib force. The digital ink class diagram is shown in Figure 33. 

30 A field element 845 can be a checkbox field 877, a text field 878, a drawing 

field 879, or a signature field 880. The field element class diagram is shown in Figure 
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34. Any digital ink captured in a field's zone 58 is assigned to the field. 

A checkbox field has an associated Boolean value 881, as shown in Figure 35. 
Any mark (a tick, a cross, a stroke, a fill zigzag, etc.) captured in a checkbox field's zone 
causes a true value to be assigned to the field's value. 

5 A text field has an associated text value 882, as shown in Figure 36. Any digital 

ink captured in a text field's zone is automatically converted to text via online 
handwriting recognition, and the text is assigned to the field's value. Online handwriting 
recognition is well-understood (see, for example, Tappert, C, C.Y. Suen and T. 
Wakahara, "The State of the Art in On-Line Handwriting Recognition", IEEE 
10 Transactions on Pattern Analysis and Machine Intelligence, Vol.12, No.8, August 1990, 
the contents of which are herein incorporated by cross-reference). Specializations of text 
fields include date and number fields. 

A signature field has an associated digital signature value 883, as shown in 
Figure 37. Any digital ink captured in a signature field's zone is automatically verified 

1 5 with respect to the identity of the owner of the pen, and a digital signature of the content 
of the form of which the field is part is generated and assigned to the field's value. The 
digital signature is generated using the pen user's private signature key specific to the 
application which owns the form. Online signature verification is well-understood (see, 
for example, Plamondon, R. and G. Lorette, "Automatic Signature Verification and 

20 Writer Identification - The State of the Art", Pattern Recognition, Vol.22, No.2, 1989, 
the contents of which are herein incorporated by cross-reference). 

A field element is hidden if its "hidden" attribute is set. A hidden field element 
does not have an input zone on a page and does not accept input. It can have an 
associated field value which is included in the form data when the form containing the 
25 field is submitted. 

"Editing" commands, such as strike-throughs indicating deletion, can also be 
recognized in form fields. 

Because the handwriting recognition algorithm works "online" (i.e. with access 
to the dynamics of the pen movement), rather than "offline" (i.e. with access only to a 
30 bitmap of pen markings), it can recognize run-on discretely-written characters with 
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relatively high accuracy, without a writer-dependent training phase. A writer-dependent 
model of handwriting is automatically generated over time, however, and can be 
generated up-front if necessary, 

Digital ink, as already stated, consists of a sequence of strokes. Any stroke 
5 which starts in a particular element's zone is appended to that element's digital ink 
stream, ready for interpretation. Any stroke not appended to an object's digital ink 
stream is appended to the background field's digital ink stream. 

Digital ink captured in the background field is interpreted as a selection 
gesture. Circumscription of one or more objects is generally interpreted as a selection of 
10 the circumscribed objects, although the actual interpretation is application-specific. 

Table 2 summarizes these various pen interactions with a netpage. 



Table 2 - Summary of pen interactions with a netpage 



Object 


Type 


Pen input 


Action 


Hyperlink 


General 


Click 


Submit action to application 


Form 


Click 


Submit form to application 


Selection 


Click 


Submit selection to application 


Form field 


Checkbox 


Any mark 


Assign true to field 


Text 


Handwriting 


Convert digital ink to text; assign 
text to field 


Drawing 


Digital ink 


Assign digital ink to field 


Signature 


Signature 


Verify digital ink signature; 
generate digital signature of form; 
assign digital signature to field 


None 




Circumscription 


Assign digital ink to current 
selection 



1 5 The system maintains a current selection for each pen. The selection consists 

simply of the most recent stroke captured in the background field. The selection is 
cleared after an inactivity timeout to ensure predictable behavior. 

The raw digital ink captured in every field is retained on the netpage page 
server and is optionally transmitted with the form data when the form is submitted to the 
20 application. This allows the application to interrogate the raw digital ink should it 
suspect the original conversion, such as the conversion of handwritten text. This can, for 
example, involve human intervention at the application level for forms which fail certain 
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application-specific consistency checks. As an extension to this, the entire background 
area of a form can be designated as a drawing field. The application can then decide, on 
the basis of the presence of digital ink outside the explicit fields of the form, to route the 
form to a human operator, on the assumption that the user may have indicated 
5 amendments to the filled-in fields outside of those fields. 

Figure 38 shows a flowchart of the process of handling pen input relative to a 
netpage. The process consists of receiving (at 884) a stroke from the pen; identifying (at 

885) the page instance 830 to which the page ID 50 in the stroke refers; retrieving (at 

886) the page description 5; identifying (at 887) a formatted element 839 whose zone 58 
10 the stroke intersects; determining (at 888) whether the formatted element corresponds to 

^3 a field element, and if so appending (at 892) the received stroke to the digital ink of the 

HJ field value 871, interpreting (at 893) the accumulated digital ink of the field, and 

I ~* determining (at 894) whether the field is part of a hyperlinked group 866 and if so 

^ activating (at 895) the associated hyperlink; alternatively determining (at 889) whether 

% 5 1 5 the formatted element corresponds to a hyperlink element and if so activating (at 895) 

j ^ the corresponding hyperlink; alternatively, in the absence of an input field or hyperlink, 

I* appending (at 890) the received stroke to the digital ink of the background field 833; and 

j'ij 

In copying (at 891) the received stroke to the current selection 826 of the current pen, as 

•i'Z maintained by the registration server. 

20 Figure 38a shows a detailed flowchart of step 893 in the process shown in 

Figure 38, where the accumulated digital ink of a field is interpreted according to the 
type of the field. The process consists of determining (at 896) whether the field is a 
checkbox and (at 897) whether the digital ink represents a checkmark, and if so 
assigning (at 898) a true value to the field value; alternatively determining (at 899) 

25 whether the field is a text field and if so converting (at 900) the digital ink to computer 
text, with the help of the appropriate registration server, and assigning (at 901) the 
converted computer text to the field value; alternatively determining (at 902) whether the 
field is a signature field and if so verifying (at 903) the digital ink as the signature of the 
pen's owner, with the help of the appropriate registration server, creating (at 904) a 

30 digital signature of the contents of the corresponding form, also with the help of the 
registration server and using the pen owner's private signature key relating to the 
corresponding application, and assigning (at 905) the digital signature to the field value. 
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1 .7.3 Page Server Commands 

A page server command is a command which is handled locally by the page 

server. It operates directly on form, page and document instances. 

A page server command 907 can be a void form command 908, a duplicate 
5 form command 909, a reset form command 910, a get form status command 911, a 
duplicate page command 912, a reset page command 913, a get page status command 
914, a duplicate document command 915, a reset document command 916, or a get 
document status command 917, as shown in Figure 39. 

A void form command voids the corresponding form instance. A duplicate 
10 form command voids the corresponding form instance and then produces an active 
printed copy of the current form instance with field values preserved. The copy contains 
the same hyperlink transaction IDs as the original, and so is indistinguishable from the 
original to an application. A reset form command voids the corresponding form instance 
and then produces an active printed copy of the form instance with field values 
1 5 discarded. The copy contains the same hyperlink transaction IDs as the original. A get 
form status command produces a printed report on the status of the corresponding form 
instance, including who published it, when it was printed, for whom it was printed, and 
the form status of the form instance. 

Since a form hyperlink instance contains a transaction ID, the application has to 
20 be involved in producing a new form instance. A button requesting a new form instance 
is therefore typically implemented as a hyperlink. 

A duplicate page command produces a printed copy of the corresponding page 
instance with the background field value preserved. If the page contains a form or is part 
of a form, then the duplicate page command is interpreted as a duplicate form command. 

25 A reset page command produces a printed copy of the corresponding page instance with 
the background field value discarded. If the page contains a form or is part of a form, 
then the reset page command is interpreted as a reset form command. A get page status 
command produces a printed report on the status of the corresponding page instance, 
including who published it, when it was printed, for whom it was printed, and the status 

30 of any forms it contains or is part of. 

The netpage logo which appears on every netpage is usually associated with a 
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duplicate page element in the preferred implementation of the invention. 

When a page instance is duplicated with field values preserved, field values are 
printed in their native form, i.e. a checkmark appears as a standard checkmark graphic, 
and text appears as typeset text. Only drawings and signatures appear in their original 
5 form, with a signature preferably accompanied by, or alternatively replaced by, a 
standard graphic indicating successful signature verification. 

A duplicate document command produces a printed copy of the corresponding 
document instance with background field values preserved. If the document contains any 
forms, then the duplicate document command duplicates the forms in the same way a 

10 duplicate form command does. A reset document command produces a printed copy of 
the corresponding document instance with background field values discarded. If the 
document contains any forms, then the reset document command resets the forms in the 
same way a reset form command does. A get document status command produces a 
printed report on the status of the corresponding document instance, including who 

15 published it, when it was printed, for whom it was printed, and the status of any forms it 
contains. 

If the page server command's "on selected" attribute is set, then the command 
operates on the page identified by the pen's current selection rather than on the page 
containing the command. This allows a menu of page server commands to be printed. If 
20 the target page doesn't contain a page server command element for the designated page 
server command, then the command is ignored. 

An application can provide application-specific handling by embedding the 
relevant page server command element in a hyperlinked group. The page server activates 
the hyperlink associated with the hyperlinked group rather than executing the page server 
25 command. 

A page server command element is hidden if its "hidden" attribute is set. A 
hidden command element does not have an input zone on a page and so cannot be 
activated directly by a user. It can, however, be activated via a page server command 
embedded in a different page, if that page server command has its "on selected" attribute 
30 set. 
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1 .8 Standard Features of Netpages 

In the preferred form, each netpage is printed with the netpage logo at the 

bottom to indicate that it is a netpage and therefore has interactive properties. The logo 
also acts as a copy button. In most cases "clicking" the logo produces a copy of the page. 
5 In the case of a form, the button produces a copy of the entire form. And in the case of a 
secure document, such as a ticket or coupon, the button elicits an explanatory note or 
advertising page. 

The default single-page copy function is handled directly by the relevant 
netpage page server. Special copy functions are handled by linking the logo button to an 
1 0 application. 



25 



1 .9 User Help System 

In a preferred embodiment, the netpage printer has a single button labeled 

"Help". When pressed it elicits a single page of information, including: 
status of printer connection 
15 • status of printer consumables 

top-level help menu 
document function menu 
top-level netpage network directory 
The help menu provides a hierarchical manual on how to use the netpage 

20 system. 

The document function menu includes the following functions: 

• print a copy of a document 

• print a clean copy of a form 

• print the status of a document 

A document function is initiated by simply pressing the button and then 
touching any page of the document. The status of a document indicates who published it 
and when, to whom it was delivered, and to whom and when it was subsequently 
submitted as a form. 
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The netpage network directory allows the user to navigate the hierarchy of 
publications and services on the network. As an alternative, the user can call the netpage 
network "900" number '"yellow pages" and speak to a human operator. The operator can 
locate the desired document and route it to the user's printer. Depending on the 
5 document type, the publisher or the user pays the small C4 yellow pages" service fee. 

The help page is obviously unavailable if the printer is unable to print. In this 
case the "error" light is lit and the user can request remote diagnosis over the network. 

2. Personalized Publication Model 

In the following description, news is used as a canonical publication example 

10 to illustrate personalization mechanisms in the netpage system. Although news is often 
used in the limited sense of newspaper and newsmagazine news, the intended scope in 
the present context is wider. 

In the netpage system, the editorial content and the advertising content of a 
news publication are personalized using different mechanisms. The editorial content is 
15 personalized according to the reader's explicitly stated and implicitly captured interest 
profile. The advertising content is personalized according to the reader's locality and 
demographic. 

2.1 Editorial Personalization 

A subscriber can draw on two kinds of news sources: those that deliver news 

20 publications, and those that deliver news streams. While news publications are 
aggregated and edited by the publisher, news streams are aggregated either by a news 
publisher or by a specialized news aggregator. News publications typically correspond to 
traditional newspapers and newsmagazines, while news streams can be many and varied: 
a "raw" news feed from a news service, a cartoon strip, a freelance writer's column, a 

25 friend's bulletin board, or the reader's own e-mail. 

The netpage publication server supports the publication of edited news 
publications as well as the aggregation of multiple news streams. By handling the 
aggregation and hence the formatting of news streams selected directly by the reader, the 
server is able to place advertising on pages over which it otherwise has no editorial 
30 control. 
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The subscriber builds a daily newspaper by selecting one or more contributing 
news publications, and creating a personalized version of each. The resulting daily 
editions are printed and bound together into a single newspaper. The various members of 
a household typically express their different interests and tastes by selecting different 
5 daily publications and then customizing them. 

For each publication, the reader optionally selects specific sections. Some 
sections appear daily, while others appear weekly. The daily sections available from The 
New York Times online, for example, include "Page One Plus", "National", 
"International", "Opinion", "Business", "Arts/Living", "Technology", and "Sports". The 
10 set of available sections is specific to a publication, as is the default subset. 

The reader can extend the daily newspaper by creating custom sections, each 
one drawing on any number of news streams. Custom sections might be created for e- 
mail and friends' announcements ("Personal"), or for monitoring news feeds for specific 
topics ("Alerts" or "Clippings"). 

15 For each section, the reader optionally specifies its size, either qualitatively 

(e.g. short, medium, or long), or numerically (i.e. as a limit on its number of pages), and 
the desired proportion of advertising, either qualitatively (e.g. high, normal, low, none), 
or numerically (i.e. as a percentage). 



20 articles or a small number of longer articles. Each article is ideally written (or edited) in 
both short and long forms to support this preference. 

An article may also be written (or edited) in different versions to match the 
expected sophistication of the reader, for example to provide children's and adults' 
versions. The appropriate version is selected according to the reader's age. The reader 
25 can specify a "reading age" which takes precedence over their biological age. 

The articles which make up each section are selected and prioritized by the 
editors, and each is assigned a useful lifetime. By default they are delivered to all 
relevant subscribers, in priority order, subject to space constraints in the subscribers' 
editions. 

30 In sections where it is appropriate, the reader may optionally enable 



The reader also optionally expresses a preference for a large number of shorter 



NPS028US 



-47- 

collaborative filtering. This is then applied to articles which have a sufficiently long 
lifetime. Each article which qualifies for collaborative filtering is printed with rating 
buttons at the end of the article. The buttons can provide an easy choice (e.g. "liked" and 
"disliked'), making it more likely that readers will bother to rate the article. 

5 Articles with high priorities and short lifetimes are therefore effectively 

considered essential reading by the editors and are delivered to most relevant 
subscribers. 

The reader optionally specifies a serendipity factor, either qualitatively (e.g. do 
or don't surprise me), or numerically. A high serendipity factor lowers the threshold used 
1 0 for matching during collaborative filtering. A high factor makes it more likely that the 
corresponding section will be filled to the reader's specified capacity. A different 
serendipity factor can be specified for different days of the week. 

The reader also optionally specifies topics of particular interest within a 
section, and this modifies the priorities assigned by the editors. 

1 5 The speed of the reader's Internet connection affects the quantity and quality at 

which images can be delivered. The reader optionally specifies a preference for fewer 
images or smaller images or both. If the number or size of images is not reduced, then 
images may be delivered at lower quality (i.e. at lower resolution or with greater 
compression). Alternatively all three of the quantity, size and quality of images delivered 

20 may be adjusted. 

At a global level, the reader specifies how quantities, dates, times and monetary 
values are localized. This involves specifying whether units are imperial or metric, a 
local time zone and time format, and a local currency, and whether the localization 
consist of in situ translation or annotation. These preferences are derived from the 
25 reader's locality by default. 

To reduce reading difficulties caused by poor eyesight, the reader optionally 
specifies a global preference for a larger presentation. Both text and images are scaled 
accordingly, and less information is accommodated on each page. 

The language in which a news publication is published, and its corresponding 
30 text encoding, is a property of the publication and not a preference expressed by the user. 
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However, the netpage system can be configured to provide automatic translation services 
in various guises. 

2.2 Advertising Localization and Targeting 

The personalization of the editorial content directly affects the advertising 

5 content, because advertising is typically placed to exploit the editorial context. Travel 
ads, for example, are more likely to appear in a travel section than elsewhere. The value 
of the editorial content to an advertiser (and therefore to the publisher) lies in its ability 
to attract large numbers of readers with the right demographics. 

Effective advertising is placed on the basis of locality and demographics. 
1 0 Locality determines proximity to particular services, retailers etc., and particular interests 
and concerns associated with the local community and environment. Demographics 
determine general interests and preoccupations as well as likely spending patterns. 

A news publisher's most profitable product is advertising "space", a multi- 
dimensional entity determined by the publication's geographic coverage, the size of its 
1 5 readership, its readership demographics, and the page area available for advertising. 

In the netpage system, the netpage publication server computes the 
approximate multi-dimensional size of a publication's saleable advertising space on a 
per-section basis, taking into account the publication's geographic coverage, the 
section's readership, the size of each reader's section edition, each reader's advertising 
20 proportion, and each reader's demographic. 

In comparison with other media, the netpage system allows the advertising 
space to be defined in greater detail, and allows smaller pieces of it to be sold separately. 
It therefore allows it to be sold at closer to its true value. 

For example, the same advertising "slot" can be sold in varying proportions to 
25 several advertisers, with individual readers' pages randomly receiving the advertisement 
of one advertiser or another, overall preserving the proportion of space sold to each 
advertiser. 

The netpage system allows advertising to be linked directly to detailed product 
information and online purchasing. It therefore raises the intrinsic value of the 
30 advertising space. 
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Because personalization and localization are handled automatically by netpage 
publication servers, an advertising aggregator can provide arbitrarily broad coverage of 
both geography and demographics. The subsequent disaggregation is efficient because it 
is automatic. This makes it more cost-effective for publishers to deal with advertising 
5 aggregators than to directly capture advertising. Even though the advertising aggregator 
is taking a proportion of advertising revenue, publishers may find the change profit- 
neutral because of the greater efficiency of aggregation. The advertising aggregator acts 
as an intermediary between advertisers and publishers, and may place the same 
advertisement in multiple publications. 

10 It is worth noting that ad placement in a netpage publication can be more 

complex than ad placement in the publication's traditional counterpart, because the 
publication's advertising space is more complex. While ignoring the full complexities of 
negotiations between advertisers, advertising aggregators and publishers, the preferred 
form of the netpage system provides some automated support for these negotiations, 

1 5 including support for automated auctions of advertising space. Automation is particularly 
desirable for the placement of advertisements which generate small amounts of income, 
such as small or highly localized advertisements. 

Once placement has been negotiated, the aggregator captures and edits the 
advertisement and records it on a netpage ad server. Correspondingly, the publisher 
20 records the ad placement on the relevant netpage publication server. When the netpage 
publication server lays out each user's personalized publication, it picks the relevant 
advertisements from the netpage ad server. 

2.3 User Profiles 

2.3.1 Information Filtering 

25 The personalization of news and other publications relies on an assortment of 

user-specific profile information, including: 

• publication customizations 

• collaborative filtering vectors 

• contact details 
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• presentation preferences 

The customization of a publication is typically publication-specific, and so the 
customization information is maintained by the relevant netpage publication server. 

A collaborative filtering vector consists of the user's ratings of a number of 
5 news items. It is used to correlate different users' interests for the purposes of making 
recommendations. Although there are benefits to maintaining a single collaborative 
filtering vector independently of any particular publication, there are two reasons why it 
is more practical to maintain a separate vector for each publication: there is likely to be 
more overlap between the vectors of subscribers to the same publication than between 
10 those of subscribers to different publications; and a publication is likely to want to 
present its users' collaborative filtering vectors as part of the value of its brand, not to be 
found elsewhere. Collaborative filtering vectors are therefore also maintained by the 
relevant netpage publication server. 

Contact details, including name, street address, ZIP Code, state, country, 
15 telephone numbers, are global by nature, and are maintained by a netpage registration 
server. 

Presentation preferences, including those for quantities, dates and times, are 
likewise global and maintained in the same way. 

The localization of advertising relies on the locality indicated in the user's 
20 contact details, while the targeting of advertising relies on personal information such as 
date of birth, gender, marital status, income, profession, education, or qualitative 
derivatives such as age range and income range. 

For those users who choose to reveal personal information for advertising 
purposes, the information is maintained by the relevant netpage registration server. In the 
25 absence of such information, advertising can be targeted on the basis of the demographic 
associated with the user's ZIP or ZEP+4 Code. 

Each user, pen, printer, application provider and application is assigned its own 
unique identifier, and the netpage registration server maintains the relationships between 
them, as shown in Figures 21, 22, 23 and 24. For registration purposes, a publisher is a 
30 special kind of application provider, and a publication is a special kind of application. 
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Each user 800 may be authorized to use any number of printers 802, and each 
printer may allow any number of users to use it. Each user has a single default printer (at 
66), to which periodical publications are delivered by default, whilst pages printed on 
demand are delivered to the printer through which the user is interacting. The server 
5 keeps track of which publishers a user has authorized to print to the user's default 
printer. A publisher does not record the ID of any particular printer, but instead resolves 
the ID when it is required. 

When a user subscribes 808 to a publication 807, the publisher 806 (i.e. 
application provider 803) is authorized to print to a specified printer or the user's default 
1 0 printer. This authorization can be revoked at any time by the user. Each user may have 
several pens 801, but a pen is specific to a single user. If a user is authorized to use a 
particular printer, then that printer recognizes any of the user's pens. 

The pen ID is used to locate the corresponding user profile maintained by a 
particular netpage registration server, via the DNS in the usual way. 

15 A Web terminal 809 can be authorized to print on a particular netpage printer, 

allowing Web pages and netpage documents encountered during Web browsing to be 
conveniently printed on the nearest netpage printer. 

The netpage system can collect, on behalf of a printer provider, fees and 
commissions on income earned through publications printed on the provider's printers. 
20 Such income can include advertising fees, click-through fees, e-commerce commissions, 
and transaction fees. If the printer is owned by the user, then the user is the printer 
provider. 

Each user also has a netpage account 820 which is used to accumulate micro- 
debits and credits (such as those described in the preceding paragraph); contact details 
25 815, including name, address and telephone numbers; global preferences 816, including 
privacy, delivery and localization settings; any number of biometric records 817, 
containing the user's encoded signature 818, fingerprint 819 etc; a handwriting model 
819 automatically maintained by the system; and SET payment card accounts 821 with 
which e-commerce payments can be made. 
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2.3.2 Favorites List 

A netpage user can maintain a list 922 of "favorites" - links to useful 

documents etc. on the netpage network. The list is maintained by the system on the 
user's behalf. It is organized as a hierarchy of folders 924, a preferred embodiment of 
5 which is shown in the class diagram in Figure 41 . 

2.3.3 History List 

The system preferably maintains a history list 929 on each user's behalf, 

containing links to documents etc. accessed by the user through the netpage system. It is 
organized as a date-ordered list, a preferred embodiment of which is shown in the class 
1 0 diagram in Figure 42. 

2.4 Intelligent Page Layout 

The netpage publication server automatically lays out the pages of each user's 

personalized publication on a section-by-section basis. Since most advertisements are in 
the form of pre-formatted rectangles, they are placed on the page before the editorial 
15 content. 

The advertising ratio for a section can be achieved with wildly varying 
advertising ratios on individual pages within the section, and the ad layout algorithm 
exploits this. The algorithm is configured to attempt to co-locate closely tied editorial 
and advertising content, such as placing ads for roofing material specifically within the 
20 publication because of a special feature on do-it-yourself roofing repairs. 

The editorial content selected for the user, including text and associated images 
and graphics, is then laid out according to various aesthetic rules. 

The entire process, including the selection of ads and the selection of editorial 
content, must be iterated once the layout has converged, to attempt to more closely 
25 achieve the user's stated section size preference. The section size preference can, 
however, be matched on average over time, allowing significant day-to-day variations. 

2.5 Document Format 

Once the document is laid out, it is encoded for efficient distribution and 

persistent storage on the netpage network. 
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The primary efficiency mechanism is the separation of information specific to a 
single user's edition and information shared between multiple users' editions. The 
specific information consists of the page layout. The shared information consists of the 
objects to which the page layout refers, including images, graphics, and pieces of text. 

5 A text object contains fully-formatted text, preferably represented in the 

Extensible Markup Language (XML) using the Extensible Stylesheet Language (XSL). 
XSL provides precise control over text formatting independently of the region into which 
the text is being set, which in this case is being provided by the layout. The text object 
contains embedded language codes to enable automatic translation, and embedded 
1 0 hyphenation hints to aid with paragraph formatting. 

I % An image object encodes an image, preferably in the JPEG 2000 wavelet-based 

y m compressed image format. A graphic object encodes a 2D graphic, preferably in Scalable 

f'LJ 

|y Vector Graphics (SVG) format. Other formats may be used for text, images and 

I J graphics. 

** 1 5 The layout itself consists of a series of placed image and graphic objects, linked 

J 7 textflow objects through which text objects flow, hyperlinks and input fields as 

I U described above, and watermark regions. These layout objects are summarized in Table 

^ 3. The layout uses a compact format suitable for efficient distribution and storage. 



20 Table 3 - netpage layout objects 



Layout 
object 


Attribute 


Format of 
linked object 


Image 


Position 




Image object ID 


JPEG 2000 


Graphic 


Position 




Graphic object ID 


SVG 


Textflow 


Textflow ID 




Zone 




Optional text object ID 


XML/XSL 


Hyperlink 


Type 




Zone 




Application ID, etc. 




Field 


Type 




Meaning 




Zone 




Watermark 


Zone 
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2.6 Document Distribution 

As described above, for purposes of efficient distribution and persistent storage 

on the netpage network, a user-specific page layout is separated from the shared objects 
5 to which it refers. 

When a subscribed publication is ready to be distributed, the netpage 
publication server allocates, with the help of the netpage ID server 12, a unique ID for 
each page, page instance, document, and document instance. 

The server computes a set of optimized subsets of the shared content and 
1 0 creates a multicast channel for each subset, and then tags each user-specific layout with 
the names of the multicast channels which will carry the shared content used by that 
layout. The server then pointcasts each user's layouts to that user's printer via the 
appropriate page server, and when the pointcasting is complete, multicasts the shared 
content on the specified channels. After receiving its pointcast, each page server and 
15 printer subscribes to the multicast channels specified in the page layouts. During the 
multicasts, each page server and printer extracts from the multicast streams those objects 
referred to by its page layouts. The page servers persistently archive the received page 
layouts and shared content. Other techniques for distributing the data may be used. 

Once a printer has received all the objects to which its page layouts refer, the 
20 printer re-creates the fully-populated layout and then rasterizes and prints it. 

Under normal circumstances, the printer prints pages faster than they can be 
delivered. Assuming a quarter of each page is covered with images, the average page has 
a size of less than 400KB. The printer can therefore hold in excess of 100 such pages in 
its internal 64MB memory, allowing for temporary buffers etc. The printer prints at a 
25 rate of one page per second. This is equivalent to 400KB or about 3Mbit of page data per 
second, which is similar to the highest expected rate of page data delivery over a 
broadband network. 

Even under abnormal circumstances, such as when the printer runs out of 
paper, it is likely that the user will be able to replenish the paper supply before the 
30 printer's 100-page internal storage capacity is exhausted. 
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However, if the printer's internal memory does fill up, then the printer will be 
unable to make use of a multicast when it first occurs. The netpage publication server 
therefore allows printers to submit requests for re-multicasts. When a critical number of 
requests is received or a timeout occurs, the server re-multicasts the corresponding 
5 shared objects. 

Once a document is printed, a printer can produce an exact duplicate at any 
time by retrieving its page layouts and contents from the relevant page server. 

2.7 Oim-Demand Documents 

When a netpage document is requested on demand, it can be personalized and 

10 delivered in much the same way as a periodical. However, since there is no shared 
J=3 content, delivery is made directly to the requesting printer without the use of multicast. 

.!,?? When a non-netpage document is requested on demand, it is not personalized, 

0 and it is delivered via a designated netpage formatting server which reformats it as a 
IQ netpage document. A netpage formatting server is a special instance of a netpage 

15 publication server. The netpage formatting server has knowledge of various Internet 

1 - document formats, including Adobe's Portable Document Format (PDF), and Hypertext 
in Markup Language (HTML). In the case of HTML, it can make use of the higher 
*;? resolution of the printed page to present Web pages in a multi-column format, with a 

table of contents. It can automatically include all Web pages directly linked to the 
20 requested page. The user can tune this behavior via a preference. 

The netpage formatting server makes standard netpage behavior, including 
interactivity and persistence, available on any Internet document, no matter what its 
origin and format. It hides knowledge of different document formats from both the 
netpage printer and the netpage page server, and hides knowledge of the netpage system 
25 from Web servers. 

3. Security 

3.1 Cryptography 

Cryptography is used to protect sensitive information, both in storage and in 

transit, and to authenticate parties to a transaction. There are two classes of cryptography 
30 in widespread use: secret-key cryptography and public-key cryptography. The netpage 
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network uses both classes of cryptography. 

Secret-key cryptography, also referred to as symmetric cryptography, uses the 
same key to encrypt and decrypt a message. Two parties wishing to exchange messages 
must first arrange to securely exchange the secret key. 

5 Public-key cryptography, also referred to as asymmetric cryptography, uses two 

encryption keys. The two keys are mathematically related in such a way that any message 
encrypted using one key can only be decrypted using the other key. One of these keys is 
then published, while the other is kept private. The public key is used to encrypt any 
message intended for the holder of the private key. Once encrypted using the public key, 
10 a message can only be decrypted using the private key. Thus two parties can securely 
]% exchange messages without first having to exchange a secret key. To ensure that the 

^ private key is secure, it is normal for the holder of the private key to generate the key 

I y 

|y pair. 

CO Public-key cryptography can be used to create a digital signature. The holder of 

. 3 i 15 the private key can create a known hash of a message and then encrypt the hash using the 

2..JL 

• r private key. Anyone can then verify that the encrypted hash constitutes the "signature" of 

I U the holder of the private key with respect to that particular message by decrypting the 

*;g encrypted hash using the public key and verifying the hash against the message. If the 

■ ,s * signature is appended to the message, then the recipient of the message can verify both 

20 that the message is genuine and that it has not been altered in transit. 

To make public-key cryptography work, there has to be a way to distribute 
public keys which prevents impersonation. This is normally done using certificates and 
certificate authorities. A certificate authority is a trusted third party which authenticates 
the connection between a public key and someone's identity. The certificate authority 
25 verifies the person's identity by examining identity documents, and then creates and 
signs a digital certificate containing the person's identity details and public key. Anyone 
who trusts the certificate authority can use the public key in the certificate with a high 
degree of certainty that it is genuine. They just have to verify that the certificate has 
indeed been signed by the certificate authority, whose public key is well-known. 

30 In most transaction environments, public-key cryptography is only used to 

create digital signatures and to securely exchange secret session keys. Secret-key 
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cryptography is used for all other purposes. 

In the following discussion, when reference is made to the secure transmission 
of information between a netpage printer and a server, what actually happens is that the 
printer obtains the server's certificate, authenticates it with reference to the certificate 
5 authority, uses the public key-exchange key in the certificate to exchange a secret session 
key with the server, and then uses the secret session key to encrypt the message data. A 
session key, by definition, can have an arbitrarily short lifetime. 

3.2 Netpage Printer Security 

Each netpage printer is assigned a pair of unique identifiers at time of 

10 manufacture which are stored in read-only memory in the printer and in the netpage 
registration server database. The first ID 62 is public and uniquely identifies the printer 
on the netpage network. The second ID is secret and is used when the printer is first 
registered on the network. 

When the printer connects to the netpage network for the first time after 
1 5 installation, it creates a signature public/private key pair. It transmits the secret ED and 
the public key securely to the netpage registration server. The server compares the secret 
ID against the printer's secret ID recorded in its database, and accepts the registration if 
the IDs match. It then creates and signs a certificate containing the printer's public ID 
and public signature key, and stores the certificate in the registration database. 

20 The netpage registration server acts as a certificate authority for netpage 

printers, since it has access to secret information allowing it to verify printer identity. 

When a user subscribes to a publication, a record is created in the netpage 
registration server database authorizing the publisher to print the publication to the user's 
default printer or a specified printer. Every document sent to a printer via a page server is 
25 addressed to a particular user and is signed by the publisher using the publisher's private 
signature key. The page server verifies, via the registration database, that the publisher is 
authorized to deliver the publication to the specified user. The page server verifies the 
signature using the publisher's public key, obtained from the publisher's certificate 
stored in the registration database. 

30 The netpage registration server accepts requests to add printing authorizations 



NPS028US 



-58- 

to the database, so long as those requests are initiated via a pen registered to the printer. 

3.3 Netpage Pen Security 

Each netpage pen is assigned a unique identifier at time of manufacture which 

is stored in read-only memory in the pen and in the netpage registration server database. 
5 The pen ID 61 uniquely identifies the pen on the netpage network. 

A netpage pen can "know" a number of netpage printers, and a printer can 
"know" a number of pens. A pen communicates with a printer via a radio frequency 
signal whenever it is within range of the printer. Once a pen and printer are registered, 
they regularly exchange session keys. Whenever the pen transmits digital ink to the 
1 0 printer, the digital ink is always encrypted using the appropriate session key. Digital ink 
is never transmitted in the clear. 

A pen stores a session key for every printer it knows, indexed by printer ID, 
and a printer stores a session key for every pen it knows, indexed by pen ID. Both have a 
large but finite storage capacity for session keys, and will forget a session key on a least- 
15 recently-used basis if necessary. 

When a pen comes within range of a printer, the pen and printer discover 
whether they know each other. If they don't know each other, then the printer determines 
whether it is supposed to know the pen. This might be, for example, because the pen 
belongs to a user who is registered to use the printer. If the printer is meant to know the 
20 pen but doesn't, then it initiates the automatic pen registration procedure. If the printer 
isn't meant to know the pen, then it agrees with the pen to ignore it until the pen is 
placed in a charging cup, at which time it initiates the registration procedure. 

In addition to its public ID, the pen contains a secret key-exchange key. The 
key-exchange key is also recorded in the netpage registration server database at time of 

25 manufacture. During registration, the pen transmits its pen ID to the printer, and the 
printer transmits the pen ID to the netpage registration server. The server generates a 
session key for the printer and pen to use, and securely transmits the session key to the 
printer. It also transmits a copy of the session key encrypted with the pen's key-exchange 
key. The printer stores the session key internally, indexed by the pen ID, and transmits 

30 the encrypted session key to the pen. The pen stores the session key internally, indexed 
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by the printer ID. 

Although a fake pen can impersonate a pen in the pen registration protocol, 
only a real pen can decrypt the session key transmitted by the printer. 

When a previously unregistered pen is first registered, it is of limited use until 
5 it is linked to a user. A registered but "un-owned" pen is only allowed to be used to 
request and fill in netpage user and pen registration forms, to register a new user to 
which the new pen is automatically linked, or to add a new pen to an existing user. 

The pen uses secret-key rather than public-key encryption because of hardware 
performance constraints in the pen. 

10 3.4 Secure Documents 

The netpage system supports the delivery of secure documents such as tickets 

and coupons. The netpage printer includes a facility to print watermarks, but will only do 
so on request from publishers who are suitably authorized. The publisher indicates its 
authority to print watermarks in its certificate, which the printer is able to authenticate. 

1 5 The "watermark" printing process uses an alternative dither matrix in specified 

"watermark" regions of the page. Back-to-back pages contain mirror-image watermark 
regions which coincide when printed. The dither matrices used in odd and even pages' 
watermark regions are designed to produce an interference effect when the regions are 
viewed together, achieved by looking through the printed sheet. 

20 The effect is similar to a watermark in that it is not visible when looking at only 

one side of the page, and is lost when the page is copied by normal means. 

Pages of secure documents cannot be copied using the built-in netpage copy 
mechanism described in Section 1.9 above. This extends to copying netpages on 
netpage-aware photocopiers. 

25 Secure documents are typically generated as part of e-commerce transactions. 

They can therefore include the user's photograph which was captured when the user 
registered biometric information with the netpage registration server, as described in 
Section 2. 

When presented with a secure netpage document, the recipient can verify its 
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authenticity by requesting its status in the usual way. The unique ID of a secure 
document is only valid for the lifetime of the document, and secure document IDs are 
allocated non-contiguously to prevent their prediction by opportunistic forgers. A secure 
document verification pen can be developed with built-in feedback on verification 
5 failure, to support easy point-of-presentation document verification. 

Clearly neither the watermark nor the user's photograph are secure in a 
cryptographic sense. They simply provide a significant obstacle to casual forgery. Online 
document verification, particularly using a verification pen, provides an added level of 
security where it is needed, but is still not entirely immune to forgeries. 

1 0 3.5 Non-Repudiation 

In the netpage system, forms submitted by users are delivered reliably to forms 

handlers and are persistently archived on netpage page servers. It is therefore impossible 
for recipients to repudiate delivery. 

E-commerce payments made through the system, as described in Section 4, are 
1 5 also impossible for the payee to repudiate. 

4. Electronic Commerce Model 

4.1 Secure Electronic Transaction (SET) 

The netpage system uses the Secure Electronic Transaction (SET) system as 

one of its payment systems. SET, having been developed by MasterCard and Visa, is 
20 organized around payment cards, and this is reflected in the terminology. However, 
much of the system is independent of the type of accounts being used. Other payment 
systems may also be used. 

In SET, cardholders and merchants register with a certificate authority and are 
issued with certificates containing their public signature keys. The certificate authority 
25 verifies a cardholder's registration details with the card issuer as appropriate, and verifies 
a merchant's registration details with the acquirer as appropriate. Cardholders and 
merchants store their respective private signature keys securely on their computers. 
During the payment process, these certificates are used to mutually authenticate a 
merchant and cardholder, and to authenticate them both to the payment gateway. 
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SET has not yet been adopted widely, partly because cardholder maintenance of 
keys and certificates is considered burdensome. Interim solutions which maintain 
cardholder keys and certificates on a server and give the cardholder access via a 
password have met with some success. 



5 4.2 SET Payments 

In the netpage system the netpage registration server acts as a proxy for the 

netpage user (i.e. the cardholder) in SET payment transactions. 

The netpage system uses biometrics to authenticate the user and authorize SET 
payments. Because the system is pen-based, the biometric used is the user's on-line 
1 0 signature, consisting of time-varying pen position and pressure. A fingerprint biometric 
can also be used by designing a fingerprint sensor into the pen, although at a higher cost. 
The type of biometric used only affects the capture of the biometric, not the 
authorization aspects of the system. 

The first step to being able to make SET payments is to register the user's 
1 5 biometric with the netpage registration server. This is done in a controlled environment, 
for example a bank, where the biometric can be captured at the same time as the user's 
identity is verified. The biometric is captured and stored in the registration database, 
linked to the user's record. The user's photograph is also optionally captured and linked 
to the record. The SET cardholder registration process is completed, and the resulting 
20 private signature key and certificate are stored in the database. The user's payment card 
information is also stored, giving the netpage registration server enough information to 
act as the user's proxy in any SET payment transaction. 

When the user eventually supplies the biometric to complete a payment, for 
example by signing a netpage order form, the printer securely transmits the order 
25 information, the pen ID and the biometric data to the netpage registration server. The 
server verifies the biometric with respect to the user identified by the pen ID, and from 
then on acts as the user's proxy in completing the SET payment transaction. 

4.3 Micro-Payments 

The netpage system includes a mechanism for micro-payments, to allow the 

30 user to be conveniently charged for printing low-cost documents on demand and for 
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copying copyright documents, and possibly also to allow the user to be reimbursed for 
expenses incurred in printing advertising material. The latter depends on the level of 
subsidy already provided to the user. 

When the user registers for e-commerce, a network account is established 
5 which aggregates micro-payments. The user receives a statement on a regular basis, and 
can settle any outstanding debit balance using the standard payment mechanism. 

The network account can be extended to aggregate subscription fees for 
periodicals, which would also otherwise be presented to the user in the form of 
individual statements. 

10 4.4 Transactions 

When a user requests a netpage in a particular application context, the 

application is able to embed a user-specific transaction ID 55 in the page. Subsequent 
input through the page is tagged with the transaction ID, and the application is thereby 
able to establish an appropriate context for the user's input. 

15 When input occurs through a page which is not user-specific, however, the 

application must use the user's unique identity to establish a context. A typical example 
involves adding items from a pre-printed catalog page to the user's virtual "shopping 
cart". To protect the user's privacy, however, the unique user ID 60 known to the 
netpage system is preferably not divulged to applications. This is to prevent different 

20 application providers from easily correlating independently accumulated behavioral data. 

The netpage registration server instead maintains an anonymous relationship 
between a user and an application via a unique alias ID 65, as shown in Figure 24. 
Whenever the user activates a hyperlink tagged with the "registered" attribute, the 
netpage page server asks the netpage registration server to translate the associated 
25 application ID 64, together with the pen ID 61, into an alias ID 65. The alias ID is then 
submitted to the hyperlink's application. 

The application maintains state information indexed by alias ID, and is able to 
retrieve user-specific state information without knowledge of the global identity of the 
user. 

30 The system also maintains an independent certificate and private signature key 
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for each of a user's applications, to allow it to sign application transactions on behalf of 
the user using only application-specific information. 

To assist the system in routing product bar code (UPC) "hyperlink" activations, 
the system records a favorite application on behalf of the user for any number of product 
5 types. 

Each application is associated with an application provider, and the system 
maintains an account on behalf of each application provider, to allow it to credit and 
debit the provider for click-through fees etc. 

An application provider can be a publisher of periodical subscribed content. 
10 The system records the user's willingness to receive the subscribed publication, as well 
,H as the expected frequency of publication. 

^ 4.5 Resource Descriptions and Copyright 

m A preferred embodiment of a resource description class diagram is shown in 

Figure 40. 

i A 15 Each document and content object may be described by one or more resource 

descriptions 842. Resource descriptions preferably use the Dublin Core metadata 

in 

f;3 element set, which is designed to facilitate discovery of electronic resources. Dublin 

Core metadata conforms to the World Wide Web Consortium (W3C) Resource 
Description Framework (RDF). Other metadata element sets may be used. 

20 A resource description may identify rights holders 920. The netpage system 

automatically transfers copyright fees from users to rights holders when users print 
copyright content. 

5. Communications Protocols 

A communications protocol defines an ordered exchange of messages between 

25 entities. In the netpage system, entities such as pens, printers and servers utilize a set of 
defined protocols to cooperatively handle user interaction with the netpage system. 

Each protocol is illustrated by way of a sequence diagram in which the 
horizontal dimension is used to represent message flow and the vertical dimension is 
used to represent time. Each entity is represented by a rectangle containing the name of 
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the entity and a vertical column representing the lifeline of the entity. During the time an 
entity exists, the lifeline is shown as a dashed line. During the time an entity is active, 
the lifeline is shown as a double line. Because the protocols considered here do not 
create or destroy entities, lifelines are generally cut short as soon as an entity ceases to 
5 participate in a protocol. 

5.1 Subscription Delivery Protocol 

A preferred embodiment of a subscription delivery protocol is shown in Figure 

43. 

A large number of users may subscribe to a periodical publication. Each user's 
10 edition may be laid out differently, but many users' editions will share common content 
such as text objects and image objects. The subscription delivery protocol therefore 
delivers document structures to individual printers via pointcast, but delivers shared 
content objects via multicast. 

The application (i.e. publisher) first obtains a document ID 51 for each 
15 document from an ID server 12. It then sends each document structure, including its 
document ID and page descriptions, to the page server 10 responsible for the document's 
newly allocated ID. It includes its own application ID 64, the subscriber's alias ID 65, 
and the relevant set of multicast channel names. It signs the message using its private 
signature key. 

20 The page server uses the application ID and alias ID to obtain from the 

registration server the corresponding user ID 60, the user's selected printer ID 62 (which 
may be explicitly selected for the application, or may be the user's default printer), and 
the application's certificate. 

The application's certificate allows the page server to verify the message 
25 signature. The page server's request to the registration server fails if the application ID 
and alias ID don't together identify a subscription 808. 

The page server then allocates document and page instance IDs and forwards 
the page descriptions, including page IDs 50, to the printer. It includes the relevant set of, 
multicast channel names for the printer to listen to. 

30 It then returns the newly allocated page IDs to the application for future 
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reference. 

Once the application has distributed all of the document structures to the 
subscribers' selected printers via the relevant page servers, it multicasts the various 
subsets of the shared objects on the previously selected multicast channels. Both page 
5 servers and printers monitor the appropriate multicast channels and receive their required 
content objects. They are then able to populate the previously pointcast document 
structures. This allows the page servers to add complete documents to their databases, 
and it allows the printers to print the documents. 

5.2 Hyperlink Activation Protocol 

10 A preferred embodiment of a hyperlink activation protocol is shown in Figure 

45. 

When a user clicks on a netpage with a netpage pen, the pen communicates the 
click to the nearest netpage printer 601. The click identifies the page and a location on 
the page. The printer already knows the ID 61 of the pen from the pen connection 
15 protocol. 

The printer determines, via the DNS, the network address of the page server 
10a handling the particular page ID 50. The address may already be in its cache if the 
user has recently interacted with the same page. The printer then forwards the pen ID, its 
own printer ID 62, the page ID and click location to the page server. 

20 The page server loads the page description 5 identified by the page ID and 

determines which input element's zone 58, if any, the click lies in. Assuming the 
relevant input element is a hyperlink element 844, the page server then obtains the 
associated application ID 64 and link ID 54, and determines, via the DNS, the network 
address of the application server hosting the application 71. 

25 The page server uses the pen ID 61 to obtain the corresponding user ID 60 from 

the registration server 11, and then allocates a globally unique hyperlink request ID 52 
and builds a hyperlink request 934. The hyperlink request class diagram is shown in 
Figure 44. The hyperlink request records the IDs of the requesting user and printer, and 
identifies the clicked hyperlink instance 862. The page server then sends its own server 

30 ID 53, the hyperlink request ID, and the link ID to the application. 
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The application produces a response document according to application- 
specific logic, and obtains a document ID 51 from an ID server 12. It then sends the 
document to the page server 10b responsible for the document's newly allocated ID, 
together with the requesting page server's ID and the hyperlink request ID. 

5 The second page server sends the hyperlink request ID and application ID to 

the first page server to obtain the corresponding user ID and printer ID 62. The first page 
server rejects the request if the hyperlink request has expired or is for a different 
application. 

The second page server allocates document instance and page IDs 50, returns 
1 0 the newly allocated page IDs to the application, adds the complete document to its own 
database, and finally sends the page descriptions to the requesting printer. 

The hyperlink instance may include a meaningful transaction ID 55, in which 
case the first page server includes the transaction ID in the message sent to the 
application. This allows the application to establish a transaction-specific context for the 
1 5 hyperlink activation. 

If the hyperlink requires a user alias, i.e. its "alias required" attribute is set, then 
the first page server sends both the pen ID 61 and the hyperlink's application ID 64 to 
the registration server 1 1 to obtain not just the user ID corresponding to the pen ID but 
also the alias ID 65 corresponding to the application ID and the user ID. It includes the 
20 alias ID in the message sent to the application, allowing the application to establish a 
user-specific context for the hyperlink activation. 



5.3 Handwriting Recognition Protocol 

When a user draws a stroke on a netpage with a netpage pen, the pen 

communicates the stroke to the nearest netpage printer. The stroke identifies the page 
25 and a path on the page. 

The printer forwards the pen ID 61, its own printer ID 62, the page ID 50 and 
stroke path to the page server 10 in the usual way. 

The page server loads the page description 5 identified by the page ID and 
determines which input element's zone 58, if any, the stroke intersects. Assuming the 
30 relevant input element is a text field 878, the page server appends the stroke to the text 
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field's digital ink. 

After a period of inactivity in the zone of the text field, the page server sends 
the pen ID and the pending strokes to the registration server 1 1 for interpretation. The 
registration server identifies the user corresponding to the pen, and uses the user's 
5 accumulated handwriting model 822 to interpret the strokes as handwritten text. Once it 
has converted the strokes to text, the registration server returns the text to the requesting 
page server. The page server appends the text to the text value of the text field. 



5.4 Signature Verification Protocol 

Assuming the input element whose zone the stroke intersects is a signature 

I g 1 0 field 880, the page server 10 appends the stroke to the signature field's digital ink. 

4 After a period of inactivity in the zone of the signature field, the page server 

! - fi sends the pen ED 61 and the pending strokes to the registration server 1 1 for verification. 

It also sends the application ID 64 associated with the form of which the signature field 
9 is part, as well as the form ID 56 and the current data content of the form. The 

^ 15 registration server identifies the user corresponding to the pen, and uses the user's 

* dynamic signature biometric 818 to verify the strokes as the user's signature. Once it has 

1 verified the signature, the registration server uses the application ID 64 and user ID 60 to 

% ' identify the user's application-specific private signature key. It then uses the key to 

generate a digital signature of the form data, and returns the digital signature to the 
20 requesting page server. The page server assigns the digital signature to the signature field 
and sets the associated form's status to frozen. 

The digital signature includes the alias ID 65 of the corresponding user. This 
allows a single form to capture multiple users' signatures. 

5.5 Form Submission Protocol 

25 A preferred embodiment of a form submission protocol is shown in Figure 46. 

Form submission occurs via a form hyperlink activation. It thus follows the 
protocol defined in Section 5.2, with some form-specific additions. 

In the case of a form hyperlink, the hyperlink activation message sent by the 
page server 10 to the application 71 also contains the form ID 56 and the current data 
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content of the form. If the form contains any signature fields, then the application 
verifies each one by extracting the alias ID 65 associated with the corresponding digital 
signature and obtaining the corresponding certificate from the registration server 1 1 . 

5.6 Commission Payment Protocol 

5 A preferred embodiment of a commission payment protocol is shown in Figure 

47. 

In an e-commerce environment, fees and commissions may be payable from an 
application provider to a publisher on click-throughs, transactions and sales. 
Commissions on fees and commissions on commissions may also be payable from the 
^ 1 0 publisher to the provider of the printer. 

SJ The hyperlink request ID 52 is used to route a fee or commission credit from 

ijl the target application provider 70a (e.g. merchant) to the source application provider 70b 

\3 (i.e. publisher), and from the source application provider 70b to the printer provider 72. 

■ - The target application receives the hyperlink request ID from the page server 1 0 

15 when the hyperlink is first activated, as described in Section 5.2. When the target 
I y application needs to credit the source application provider, it sends the application 

provider credit to the original page server together with the hyperlink request ID. The 
□ page server uses the hyperlink request ID to identify the source application, and sends 

the credit on to the relevant registration server 1 1 together with the source application ID 
20 64, its own server ID 53, and the hyperlink request ID. The registration server credits the 
corresponding application provider's account 827. It also notifies the application 
provider. 

If the application provider needs to credit the printer provider, it sends the 
printer provider credit to the original page server together with the hyperlink request ID. 
25 The page server uses the hyperlink request ID to identify the printer, and sends the credit 
on to the relevant registration server together with the printer ID. The registration server 
credits the corresponding printer provider account 814. 

The source application provider is optionally notified of the identity of the 
target application provider, and the printer provider of the identity of the source 
30 application provider. 
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6. Netpage Pen Description 

6.1 Pen Mechanics 

Referring to Figures 8 and 9, the pen, generally designated by reference 

numeral 101, includes a housing 102 in the form of a plastics moulding having walls 103 
5 defining an interior space 104 for mounting the pen components. The pen top 105 is in . 
operation rotatably mounted at one end 106 of the housing 102. A semi-transparent cover 
107 is secured to the opposite end 108 of the housing 102. The cover 107 is also of 
moulded plastics, and is formed from semi-transparent material in order to enable the 
user to view the status of the LED mounted within the housing 102. The cover 107 
10 includes a main part 109 which substantially surrounds the end 108 of the housing 102 
and a projecting portion 110 which projects back from the main part 109 and fits within 
a corresponding slot 111 formed in the walls 103 of the housing 102. A radio antenna 

1 12 is mounted behind the projecting portion 110, within the housing 102. Screw threads 

113 surrounding an aperture 1 13A on the cover 107 are arranged to receive a metal end 
15 piece 114, including corresponding screw threads 115. The metal end piece 114 is 

removable to enable ink cartridge replacement. 

Also mounted within the cover 107 is a tri-color status LED 1 16 on a flex PCB 
117. The antenna 112 is also mounted on the flex PCB 117. The status LED 116 is 
mounted at the top of the pen 101 for good all-around visibility. 

20 The pen can operate both as a normal marking ink pen and as a non-marking 

stylus. An ink pen cartridge 118 with nib 119 and a stylus 120 with stylus nib 121 are 
mounted side by side within the housing 102. Either the ink cartridge nib 119 or the 
stylus nib 121 can be brought forward through open end 122 of the metal end piece 114, 
by rotation of the pen top 105. Respective slider blocks 123 and 124 are mounted to the 

25 ink cartridge 118 and stylus 120, respectively. A rotatable cam barrel 125 is secured to 
the pen top 105 in operation and arranged to rotate therewith. The cam barrel 125 
includes a cam 126 in the form of a slot within the walls 181 of the cam barrel. Cam 
followers 127 and 128 projecting from slider blocks 123 and 124 fit within the cam slot 
126. On rotation of the cam barrel 125, the slider blocks 123 or 124 move relative to 

30 each other to project either the pen nib 1 19 or stylus nib 121 out through the hole 122 in 
the metal end piece 1 14. The pen 101 has three states of operation. By turning the top 
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105 through 90° steps, the three states are: 
Stylus 120 nib 121 out; 

• Ink cartridge 118 nib 119 out; and 

• Neither ink cartridge 118 nib 119 out nor stylus 120 nib 121 out. 

5 A second flex PCB 129, is mounted on an electronics chassis 130 which sits 

within the housing 102. The second flex PCB 129 mounts an infrared LED 131 for 
providing infrared radiation for projection onto the surface. An image sensor 132 is 
provided mounted on the second flex PCB 129 for receiving reflected radiation from the 
surface. The second flex PCB 129 also mounts a radio frequency chip 133, which 

10 includes an RF transmitter and RF receiver, and a controller chip 134 for controlling 
operation of the pen. An optics block 135 (formed from moulded clear plastics) sits 
within the cover 107 and projects an infrared beam onto the surface and receives images 
onto the image sensor 132. Power supply wires 136 connect the components on the 
second flex PCB 129 to battery contacts 137 which are mounted within the cam barrel 

15 125. A terminal 138 connects to the battery contacts 137 and the cam barrel 125. A three 
volt rechargeable battery 139 sits within the cam barrel 125 in contact with the battery 
contacts. An induction charging coil 140 is mounted about the second flex PCB 129 to 
enable recharging of the battery 139 via induction. The second flex PCB 129 also 
mounts an infrared LED 143 and infrared photodiode 144 for detecting displacement in 

20 the cam barrel 125 when either the stylus 120 or the ink cartridge 1 1 8 is used for writing, 
in order to enable a determination of the force being applied to the surface by the pen nib 
119 or stylus nib 121. The IR photodiode 144 detects light from the IR LED 143 via 
reflectors (not shown) mounted on the slider blocks 123 and 124. 

Rubber grip pads 141 and 142 are provided towards the end 108 of the housing 
25 102 to assist gripping the pen, and top 105 also includes a clip 142 for clipping the pen 
to a pocket. 

6.2 Pen Controller 

The pen 101 is arranged to determine the position of its nib (stylus nib 121 or 

ink cartridge nib 1 19) by imaging, in the infrared spectrum, an area of the surface in the 
30 vicinity of the nib. It records the location data from the nearest location tag, and is 
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arranged to calculate the distance of the nib 121 or 119 from the location tab utilising 
optics 135 and controller chip 134. The controller chip 134 calculates the orientation of 
the pen and the nib-to-tag distance from the perspective distortion observed on the 
imaged tag. 

5 Control data from the location tag may include control bits instructing the pen 

to activate its "active area" LED (this is in fact one mode of the tri-color LED 116, 
which becomes yellow when the pen determines, from the control data, that the area that 
is being imaged is an "active area"). Thus, a region on the surface which corresponds to 
the active area of a button or hyperlink may be encoded to activate this LED, giving the 
10 user of the pen visual feedback that the button or hyperlink is active when the pen 101 
passes over it. Control data may also instruct the pen to capture continuous pen force 
readings. Thus a region on the surface which corresponds to a signature input area can be 
encoded to capture continuous pen forcie. 

Pen action relative to the surface may comprise a series of strokes. A stroke 
1 5 consists of a sequence of time-stamped pen positions on the surface, initiated by pen- 
down event and completed by a subsequent pen-up event. Note that pen force can be 
interpreted relative to a threshold to indicate whether the pen is "up" or "down", as well 
as being interpreted as a continuous value, for example when the pen is capturing a 
signature. The sequence of captured strokes constitutes so-called "digital ink". Digital 
20 ink can be used with a computing system to form the basis for the digital exchange of 
drawings and handwriting, for on-line recognition of handwriting, and for on-line 
verification of signatures. 

Utilizing the RF chip 133 and antenna 1 12 the pen 101 can transmit the digital 
ink data (which is encrypted for security and packaged for efficient transmission) to the 
25 computing system. 

When the pen is in range of a receiver, the digital ink data is transmitted as it is 
formed. When the pen moves out of range, digital ink data is buffered within the pen (the 
pen circuitry includes a buffer arranged to store digital ink data for approximately 12 
minutes of the pen motion on the surface) and can be transmitted later. 

30 The controller chip 134 is mounted on the second flex PCB 129 in the pen. 

Figure 10 is a block diagram illustrating in more detail the architecture of the controller 
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chip 134. Figure 10 also shows representations of the RF chip 133, the image sensor 132, 
the tri-color status LED 116, the IR illumination LED 131, the IR force sensor LED 143, 
and the force sensor photodiode 144. 

The pen controller chip 134 includes a controlling processor 145. Bus 146 
5 enables the exchange of data between components of the controller chip 134. Flash 
memory 147 and a 512 KB DRAM 148 are also included. An analog-to-digital converter 
149 is arranged to convert the analog signal from the force sensor photodiode 144 to a 
digital signal. 

An image sensor interface 152 interfaces with the image sensor 132. A 
10 transceiver controller 153 and base band circuit 154 are also included to interface with 
the RF chip 133 which includes an RF circuit 155 and RF resonators and inductors 156 
connected to the antenna 112. 

The controlling processor 145 captures and decodes location data from tags 
from the surface via the image sensor 132, monitors the force sensor photodiode 144, 
15 controls the LEDs 116, 131 and 143, and handles short-range radio communication via 
the radio transceiver 153. It is a medium-performance (~40MHz) general-purpose RISC 
processor. 

The processor 145, digital transceiver components (transceiver controller 153 
and baseband circuit 154), image sensor interface 152, flash memory 147 and 512KB 
20 DRAM 148 are integrated in a single controller ASIC. Analog RF components (RF 
circuit 155 and RF resonators and inductors 156) are provided in the separate RF chip. 

The image sensor is a CCD of with an approximate resolution of 215x215 
pixels (such a sensor is produced by Matsushita Electronic Corporation, and is described 
in a paper by Itakura, K T Nobusada, N Okusenya, R Nagayoshi, and M Ozaki, "A 1mm 
25 50k-Pixel IT CCD Image Sensor for Miniature Camera System", IEEE Transactions on 
Electronic Devices, Volt 47, number 1, January 2000, which is incorporated herein by 
reference) with an IR filter. 

The controller ASIC 134 enters a quiescent state after a period of inactivity^ 
when the pen 101 is not in contact with a surface. It incorporates a dedicated circuit 150 
30 which monitors the force sensor photodiode 144 and wakes up the controller 134 via the 
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power manager 151 on a pen-down event. 

The radio transceiver communicates in the unlicensed 900MHz band normally 
used by cordless telephones, or alternatively in the unlicensed 2.4GHz industrial, 
scientific and medical (ISM) band, and uses frequency hopping and collision detection to 
5 provide interference-free communication. 



6.3 Pen Optics 

As discussed above, the pen optics are implemented by a molded optics body 

135. The optics implemented by the optics body 135 are illustrated schematically in 
Figure 67. The optics comprise a first lens 157 for focusing radiation from the infrared 
10 LED 131, a mirror 158, a beam splitter 159, an objective lens 160 and a second lens 161 
for focusing an image onto image sensor 132. Axial rays 162 illustrate the optical path. 



The optical path is designed to deliver a sharp image to the image sensor 132 of 
that part 193 of the imaged surface which intersects the field of view cone 192, within 
required tilt ranges (see later). The primary focusing element is the objective lens 160. 

15 This is also used in reverse to project illumination from the IR illumination LED 131 
onto the surface within the field of view. Since it is impractical to place both the image 
sensor 132 and the IR LED 131 at the focus of the objective, a beam splitter 159 is used 
to split the path and separate relay lenses 157 and 161 in each path provides refocusing at 
the image sensor 132 and the IR LED 131 respectively. This also allows different 

20 apertures to be imposed on the two paths. 

The edges of the image sensor 132 act as the field stop for the capture field, and 
the capture path is designed so that the resulting object space angular field of view is as 
required (i.e. just under 20° for the application of this embodiment). The illumination 
path is designed to produce the same object space field of view as the capture path, so 
25 that the illumination fills the object space field of view with maximum power and 
uniformity. 

The IR LED 131 is strobed in synchrony with frame capture. The use of 
focussed illumination allows both a short exposure time and a small aperture. The short 
exposure time prevents motion blur, thus allowing position tag data capture during pen 
30 movement. The small aperture allows sufficient depth of field for the full range of 
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surface depths induced by tilt. The capture path includes an explicit aperture stop 191 for 
this purpose. 

Because the image sensor 132 has a strong response throughout the visible and 
near infrared part of the spectrum, it is preceded by an infrared filter 163 in the capture 
5 path so that it captures a clean image of the tag data on the surface, free from 
interference from other graphics on the surface which may be printed using inks which 
are transparent in the near infrared. 

6.4 Pen Processing 

When the stylus nib 121 or ink cartridge nib 1 19 of the pen is in contact with a 

10 surface, the pen determines its position and orientation relative to the surface at 100 Hz 
to allow accurate handwriting recognition (see the article by Tappert, C,CY Suen and T 
Wakahara, "The State of the Art in On-Line Hand Writing Recognition" IEEE 
Transactions on Patent Analysis and Machine Intelligence, Vol 12, number 8, August 
1990, the disclosure of which is incorporated herein by cross-reference). The force 

1 5 sensor photodiode 144 is utilised to indicate relative threshold whether the pen is "up" or 
"down". The force may also be captured as a continuous value, as discussed above, to 
allow the full dynamics of a signature to be verified. 

The pen determines the position and orientation of its nib 119, 121 on the 
surface by imaging, in the infrared spectrum, an area of the surface in the vicinity of the 

20 nib 119, 121. It decodes the nearest tag data and computes the position of the nib 119, 
121 relative to the location tag from the observed perspective distortion on the imaged 
tag and the known geometry of the pen optics 135 (see later). Although the position 
resolution of the tag may be low, the adjusted position resolution is quite high, and easily 
exceeds the 200 dpi resolution required for accurate handwriting recognition (see above 

25 reference). 

Pen actions relative to a surface are captured as a series of strokes. A stroke 
consists of a sequence of time-stamped pen positions on the surface, initiated by a pen- 
down event and completed by the subsequent pen-up event. A stroke is also tagged with 
the region ID of the surface whenever the region ID changes, i.e. just at the start of the 
30 stroke under normal circumstances. As discussed above, each location tag includes data 
indicative of its position on the surface and also region data indicative of the region of 
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the surface within which the tag lies. 

Figure 68 is a diagram illustrating location tag and stroke processing in the pen. 
The pen controller 134 starts with an empty stroke (at 164). It then continuously samples 
(at 165) the nib force via the force sensor photodiode 144, checking for a pen-down state 
5 (at 166). If the pen is in a pen-down state, the pen controller 134 captures (at 167) an 
image of the surface, locates (at 168) a tag in the captured image, decodes (at 169) tag 
data from the tag, infers (at 170) the pen position and orientation relative to the surface, 
and appends (at 171) the position data to the current stroke data. On detection of a pen- 
up event, i.e. on detection (at 1 72) of a pen-up state after a pen-down state as indicated 
10 by the presence of a non-empty stroke, the pen controller 134 encrypts (at 173) the stroke 
data and transmits (at 174) the stroke data via the RF chip 133 and antenna 112, to the 
computing system. It then starts with another empty stroke (at 164). 

Assuming a reasonably fast 8 bit multiply (3 cycles), the processing algorithm 
(see later) uses about 80% of the processor's time when the pen is active. 

15 If the pen is out of range of a computing system to transmit to, then it buffers 

digital ink in its internal memory. It transmits any buffered digital ink when it is next 
within range of a computing system. When the pen's internal memory is full the pen 
ceases to capture digital ink and instead flashes its error LED whenever the user attempts 
to write with the pen. 

20 Table 4 lists the components of the raw digital ink transmitted from the pen to 

the computing system. Figure 69 is a diagram illustrating the structure of the raw digital 
ink. Digital ink which is buffered in the pen when the pen is working offline is stored in 
the same form as digital ink which is transmitted to the system. 

25 Table 4. Raw digital ink components 



raw digital ink 
component 


unit 


precision 
(bits) 


range 


pen ID 




128 




nib ID 




128 




absolute time 


ms 


64 




last system time 


ms 


64 
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region ID 


- 


100 


- 


time offset 


ms 


32 


49.7 days 


tag ID 


- 


16 


- 


x offset 


20 ixm 


S9 


±10mm 


y offset 


20 (am 


S9 


±10mm 


x rotation (pitch) 


degree 


S7 


±90° 


y rotation (roll) 


degree 


S7 


±90° 


z rotation (yaw) 


degree 


S7 


360° 


z force 




8 


255 



When the pen 101 connects to the computing system, the controller 134 
notifies the system, in the form of a raw digital ink header 182, of the pen ID, nib ID, 
>3 current absolute time, and the last absolute time it obtained from the system prior to 

ry 5 going offline. This allows the system to compute any drift in the pen's clock and 

lr timeshift any digital ink received from the pen accordingly. The pen then synchronizes 

W its real-time clock with the accurate real-time clock of the system. The pen ID allows the 

computing system to identify the pen when there is more than one pen being operated 
I "2 with the computing system. Pen IDs may be important in systems which use the pen to 

l y 10 identify an owner of the pen, for example, and interact with that owner in a particular 

p directed manner. In other embodiments this may not be required. The nib ID allows the 

computing system to identify which nib, stylus nib 121 or ink cartridge nib 119, is 
presently being used. The computing system can vary its operation depending upon 
which nib is being used. For example, if the ink cartridge nib 119 is being used the 
1 5 computing system may defer producing feedback output because immediate feedback is 
provided by the ink markings made on the surface. Where the stylus nib 121 is being 
used, the computing system may produce immediate feedback output. 

At the start of a stroke the pen controller 134 records, in a raw stroke header 
183, the elapsed time since the last absolute time notified to the system. For each pen 
20 position in the stroke the controller 134 records, in the form of a raw pen position 177, 
the x and y offset of the pen nib 119, 121 from the current tag, the x, y and z rotation of 
the pen, and the nib force. It only records the tag ID (which identifies a tag within a 
region) if it has changed, in the form of a tag change 178. Since the tag frequency is 
significantly smaller than the typical position sampling frequency, the tag ID is constant 
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for many consecutive pen positions, and may be constant for the entire stroke if the 
stroke is short. 

Since the pen samples its positions and orientation at 100 Hz, pen positions in a 
stroke are implicitly clocked at 100 Hz and do not need an explicit timestamp. If the pen 
5 fails to compute a pen position, e.g. because it fails to decode a tag, it must still record a 
pen position to preserve the implicit clocking. It therefore records the position as 
unknown, in the form of an unknown pen position 179, allowing the computing system 
to later interpolate the position from adjacent samples if necessary. 

Since the 32-bit time offset of a stroke has a finite range (i.e. 49.7 days), the 
10 pen optionally records an absolute time, in the form of a time change 176, for a stroke. 
This becomes the absolute time relative to which later strokes' time offsets are 
measured. 

Since the region ID is constant for many consecutive strokes, the pen only 
records the region ID when it change, in the form of a region change 1 80. This becomes 
1 5 the region ID implicitly associated with later pen positions. 

Since a user may change the nib 119, 121 between one stroke and the next, the 
pen optionally records a nib ID for a stroke, in the form of a nib change 175. This 
becomes the nib ID implicitly associated with later strokes. 

Each component of a stroke has an entropy-coded prefix, as listed in Table 5. 

20 

Table 5. Raw stroke component prefixes 



raw stroke components 


prefix 


raw pen position 


0 


unknown pen position 


10 


tag change 


1100 


end of stroke 


1101 


region change 


11100 


nib change 


11101 


time change 


11110 



A 10mm stroke of 1 second duration spans two or three tags, contains 100 
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positions samples, and therefore has a size of about 5500 bits. Online continuous digital 
ink capture therefore requires a maximum transmission speed of 5.5 Kbps, and offline 
continuous digital ink capture requires about 40 Kbytes of buffer memory per minute. 
The pen's 512KB DRAM 48 can therefore hold over 12 minutes of continuous digital 
5 ink. Time, region and nib changes happen so infrequently that they have a negligible 
effect on the required transmission speed and buffer memory. Additional compression of 
pen positions can reduce transmission speed and buffer memory requirements further. 

Each raw stroke is encrypted using the Triple-DES algorithm (Schneier, B, 
Applied Cryptography, Second Edition, Wiley 1996, the disclosure of which is 
10 incorporated herein by cross-reference) before being transmitted to the computing 
O system. The pen and computing system exchange session keys for this purpose on a 

sj regular basis. Based on a conservative estimate of 50 cycles per encrypted bit, the 

fn encryption of a one-second 5500 bit stroke consumes 0.7% of the processor's 45 time. 



6.5 Other Pen Embodiments 

15 In an alternative embodiment, the pen incorporates an Infrared Data 

Association (IrDA) interface for short-range communication with a base station or 
netpage printer. 

In a further embodiment, the pen 101 includes a pair of orthogonal 
accelerometers mounted in the normal plane of the pen axis. The accelerometers 190 are 
20 shown in Figures 9 and 10 in ghost outline. 

The provision of the accelerometers enables this embodiment of the pen to 
sense motion without reference to surface location tags, allowing the location tags to be 
sampled at a lower rate. Each location tag ID can then identify an object of interest rather 
than a position on the surface. For example, if the object is a user interface input element 
25 (e.g. a command button), then the tag ID of each location tag within the area of the input 
element can directly identify the input element. 

The acceleration measured by the accelerometers in each of the x and y 
directions is integrated with respect to time to produce an instantaneous velocity and 
position. 

30 Since the starting position of the stroke is not known, only relative positions 
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within a stroke are calculated. Although position integration accumulates errors in the 
sensed acceleration, accelerometers typically have high resolution, and the time duration 
of a stroke, over which errors accumulate, is short. 



7 Netpage Printer Description 

5 7.1 Printer Mechanics 

The vertically-mounted netpage wallprinter 601 is shown fully assembled in 

Figure 11. It prints netpages on Letter/A4 sized media using duplexed 8V2" Memjet™ 
print engines 602 and 603, as shown in Figures 12 and 12a. It uses a straight paper path 
with the paper 604 passing through the duplexed print engines 602 and 603 which print 
1 0 both sides of a sheet simultaneously, in full color and with full bleed. 

An integral binding assembly 605 applies a strip of glue along one edge of each 
printed sheet, allowing it to adhere to the previous sheet when pressed against it. This 
creates a final bound document 618 which can range in thickness from one sheet to 
several hundred sheets. 

15 The replaceable ink cartridge 627, shown in Figure 13 coupled with the 

duplexed print engines, has bladders or chambers for storing fixative, adhesive, and 
cyan, magenta, yellow, black and infrared inks. The cartridge also contains a micro air 
filter in a base molding. The micro air filter interfaces with an air pump 638 inside the 
printer via a hose 639. This provides filtered air to the printheads to prevent ingress of 

20 micro particles into the Memjet™ printheads 350 which might otherwise clog the 
printhead nozzles. By incorporating the air filter within the cartridge, the operational life 
of the filter is effectively linked to the life of the cartridge. The ink cartridge is a fully 
recyclable product with a capacity for printing and gluing 3000 pages (1 500 sheets). 

Referring to Figure 12, the motorized media pick-up roller assembly 626 
25 pushes the top sheet directly from the media tray past a paper sensor on the first print 
engine 602 into the duplexed Memjet™ printhead assembly. The two Memjet™ print 
engines 602 and 603 are mounted in an opposing in-line sequential configuration along 
the straight paper path. The paper 604 is drawn into the first print engine 602 by integral, 
powered pick-up rollers 626. The position and size of the paper 604 is sensed and full 
30 bleed printing commences. Fixative is printed simultaneously to aid drying in the 
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shortest possible time. 

The paper exits the first Memjet™ print engine 602 through a set of powered 
exit spike wheels (aligned along the straight paper path), which act against a rubberized 
roller. These spike wheels contact the 'wet' printed surface and continue to feed the 
sheet 604 into the second Memjet™ print engine 603. 

Referring to Figures 12 and 12a, the paper 604 passes from the duplexed print 
engines 602 and 603 into the binder assembly 605. The printed page passes between a 
powered spike wheel axle 670 with a fibrous support roller and another movable axle 
with spike wheels and a momentary action glue wheel. The movable axle/glue assembly 
673 is mounted to a metal support bracket and it is transported forward to interface with 
the powered axle 670 via gears by action of a camshaft. A separate motor powers this 
camshaft. 

The glue wheel assembly 673 consists of a partially hollow axle 679 with a 
rotating coupling for the glue supply hose 641 from the ink cartridge 627. This axle 679 
connects to a glue wheel, which absorbs adhesive by capillary action through radial 
holes. A molded housing 682 surrounds the glue wheel, with an opening at the front. 
Pivoting side moldings and sprung outer doors are attached to the metal bracket and 
hinge out sideways when the rest of the assembly 673 is thrust forward. This action 
exposes the glue wheel through the front of the molded housing 682. Tension springs 
close the assembly and effectively cap the glue wheel during periods of inactivity. 

As the sheet 604 passes into the glue wheel assembly 673, adhesive is applied 
to one vertical edge on the front side (apart from the first sheet of a document) as it is 
transported down into the binding assembly 605. 

7.2 Printer Controller Architecture 

The netpage printer controller consists of a controlling processor 750, a factory- 
installed or field-installed network interface module 625, a radio transceiver (transceiver 
controller 753, baseband circuit 754, RF circuit 755, and RF resonators and inductors 
756), dual raster image processor (RIP) DSPs 757, duplexed print engine controllers 
760a and 760b, flash memory 658, and 64MB of DRAM 657, as illustrated in Figure 14. 



The controlling processor handles communication with the network 19 and 
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with local wireless netpage pens 101, senses the help button 617, controls the user 
interface LEDs 613-616, and feeds and synchronizes the RIP DSPs 757 and print engine 
controllers 760. It consists of a medium-performance general-purpose microprocessor. 
The controlling processor 750 communicates with the print engine controllers 760 via a 
5 high-speed serial bus 659. 

The RIP DSPs rasterize and compress page descriptions to the netpage printer's 
compressed page format. Each print engine controller expands, dithers and prints page 
images to its associated Memjet™ printhead 350 in real time (i.e. at over 30 pages per 
minute). The duplexed print engine controllers print both sides of a sheet simultaneously. 

10 The master print engine controller 760a controls the paper transport and 

I '5 monitors ink usage in conjunction with the master QA chip 665 and the ink cartridge QA 

,-/t chip 761. 

^5 The printer controller's flash memory 658 holds the software for both the 

w processor 750 and the DSPs 757, as well as configuration data. This is copied to main 

15 memory 657 at boot time. 

m The processor 750, DSPs 757, and digital transceiver components (transceiver 

I S controller 753 and baseband circuit 754) are integrated in a single controller ASIC 656. 

•:f Analog RF components (RF circuit 755 and RF resonators and inductors 756) are 

provided in a separate RF chip 762. The network interface module 625 is separate, since 
20 netpage printers allow the network connection to be factory-selected or field-selected. 

Flash memory 658 and the 2x256Mbit (64MB) DRAM 657 is also off-chip. The print 

engine controllers 760 are provided in separate ASICs. 

A variety of network interface modules 625 are provided, each providing a 
netpage network interface 751 and optionally a local computer or network interface 752. 

25 Netpage network Internet interfaces include POTS modems, Hybrid Fiber-Coax (HFC) 
cable modems, ISDN modems, DSL modems, satellite transceivers, current and next- 
generation cellular telephone transceivers, and wireless local loop (WLL) transceivers. 
Local interfaces include IEEE 1284 (parallel port), lOBase-T and 100Base-T Ethernet, 
USB and USB 2.0, IEEE 1394 (Firewire), and various emerging home networking 

30 interfaces. If an Internet connection is available on the local network, then the local 
network interface can be used as the netpage network interface. 



NPS028US 




-82- 

The radio transceiver 753 communicates in the unlicensed 900MHz band 
normally used by cordless telephones, or alternatively in the unlicensed 2.4GHz 
industrial, scientific and medical (ISM) band, and uses frequency hopping and collision 
detection to provide interference-free communication. 

5 The printer controller optionally incorporates an Infrared Data Association 

(IrDA) interface for receiving data "squirted" from devices such as netpage cameras. In 
an alternative embodiment, the printer uses the IrDA interface for short-range 
communication with suitably configured netpage pens. 

7-2.1 Rasterization and Printing 

. 10 Once the main processor 750 has received and verified the document's page 

;*3 layouts and page objects, it runs the appropriate RIP software on the DSPs 757. 

J ~ The DSPs 757 rasterize each page description and compress the rasterized page 

0 image. The main processor stores each compressed page image in memory. The simplest 
■i:q way to load-balance multiple DSPs is to let each DSP rasterize a separate page. The 
~\ 15 DSPs can always be kept busy since an arbitrary number of rasterized pages can, in 

H general, be stored in memory. This strategy only leads to potentially poor DSP utilization 

1 y 

Hjl when rasterizing short documents. 

O Watermark regions in the page description are rasterized to a contone- 

resolution bi-level bitmap which is losslessly compressed to negligible size and which 
20 forms part of the compressed page image. 

The infrared (IR) layer of the printed page contains coded netpage tags at a 
density of about six per inch. Each tag encodes the page ID, tag ID, and control bits, and 
the data content of each tag is generated during rasterization and stored in the 
compressed page image. 

25 The main processor 750 passes back-to-back page images to the duplexed print 

engine controllers 760. Each print engine controller 760 stores the compressed page 
image in its local memory, and starts the page expansion and printing pipeline. Page 
expansion and printing is pipelined because it is impractical to store an entire 1 14MB bi- 
level CM YK+IR page image in memory. 
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7.2.2 Print Engine Controller 

The page expansion and printing pipeline of the print engine controller 760 

consists of a high speed IEEE 1394 serial interface 659, a standard JPEG decoder 763, a 
standard Group 4 Fax decoder 764, a custom halftoner/compositor unit 765, a custom tag 
5 encoder 766, a line loader/formatter unit 767, and a custom interface 768 to the 
Memjet™ printhead 350. 

The print engine controller 360 operates in a double buffered manner. While 
one page is loaded into DRAM 769 via the high speed serial interface 659, the 
previously loaded page is read from DRAM 769 and passed through the print engine 
1 0 controller pipeline. Once the page has finished printing, the page just loaded is printed 

0 while another page is loaded. 

^ The first stage of the pipeline expands (at 763) the JPEG-compressed contone 

1 y CMYK layer, expands (at 764) the Group 4 Fax-compressed bi-level black layer, and 
1 J renders (at 766) the bi-level netpage tag layer according to the tag format defined in 
^ 8 15 section 1.2, all in parallel. The second stage dithers (at 765) the contone CMYK layer 
I ^ and composites (at 765) the bi-level black layer over the resulting bi-level CMYK layer. 

The resultant bi-level CMYK+IR dot data is buffered and formatted (at 767) for printing 
5 iJ on the Memjet™ printhead 350 via a set of line buffers. Most of these line buffers are 

O stored in the off-chip DRAM. The final stage prints the six channels of bi-level dot data 

20 (including fixative) to the Memjet™ printhead 350 via the printhead interface 768. 

When several print engine controllers 760 are used in unison, such as in a 
duplexed configuration, they are synchronized via a shared line sync signal 770. Only 
one print engine 760, selected via the external master/slave pin 771, generates the line 
sync signal 770 onto the shared line. 

25 The print engine controller 760 contains a low-speed processor 772 for 

synchronizing the page expansion and rendering pipeline, configuring the printhead 350 
via a low-speed serial bus 773, and controlling the stepper motors 675, 676. 

In the versions of the netpage printer, the two print engines each prints 30 
Letter pages per minute along the long dimension of the page (1 1"), giving a line rate of 
30 8.8 kHz at 1600 dpi. In the 12" versions of the netpage printer, the two print engines 
each prints 45 Letter pages per minute along the short dimension of the page (8i/2"), 



NPS028US 




-84- 

giving a line rate of 10.2 kHz. These line rates are well within the operating frequency of 
the Memjet™ printhead, which in the current design exceeds 30 kHz. 

8. Netpage Tags 
8.1 Tag Tiling 

5 8.1 .1 Planar Surface Tag Tiling 

In order to support "single-click" interaction with a tagged region via a sensing 

device, the sensing device must be able to see at least one entire tag 4 in its field of view 
no matter where in the region or at what orientation it is positioned. The required 
diameter of the field of view of the sensing device is therefore a function of the size and 
1 0 spacing of the tags 4. 

In the case where the tag shape is circular, such as the preferred tag 4 described 
earlier, the minimum diameter m of the sensor field of view is obtained when the tags 
500, of diameter k, are tiled on an equilateral triangular grid, as shown in Figure 52 and 
defined in EQ 1. This is achieved when the center-to-center tag spacing is the same as 
1 5 the tag diameter k. 

With a tag diameter k of 256 dots (-4 mm at 1600 dpi), m is therefore 552 dots 
(-8.8 mm). With a quiet area of 16 dots, i.e. an effective tag diameter k of 272 dots (-4.3 
mm), m increases to 587 dots (—9.3 mm). 

When the tags 4 are moved a distance s apart, where s is at least as large as k, 
20 then the minimum field of view is given by EQ 2. 

When no overlap is desired in the horizontal direction between successive lines 
of tags 500, for example to make tag rendering easier, the tags must be moved apart by a 
minimum amount given by EQ 3. For a 256-dot diameter tag, u is therefore 40 dots 
(-0.6 mm at 1600 dpi). Since this exceeds the quiet area required for the tag, the quiet 
25 area can be ignored if tag lines are rendered to not overlap. 

Setting s = k + u in EQ 2 gives EQ 4. For a 256-dot diameter tag, s is therefore 
296 dots (-4.7 mm at 1600 dpi), and m is 598 dots (-9.5 mm). 

8.1.2 Spherical Surface Tag Tiling 

A regular icosahedron is often used as the basis for generating an almost 
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regular triangular tiling of a sphere. A regular icosahedron, such as icosahedron 526 in 
Figure 53, is composed of twenty equal-sized equilateral triangular faces 528 sharing 
thirty edges 530 and twelve vertices 532, with five of the edges 530 meeting at each of 
the vertices 532. 

5 To achieve the required tiling, the icosahedron 526 is inscribed in a target 

sphere, and each triangle 528 of the icosahedron 526 is subdivided into an equal number 
of equal-sized equilateral subdivision triangles to yield the desired total number of 
triangles. If each edge 530 of the icosahedron is divided into v equal intervals, defining a 
set of v— 1 points along each edge, and each pair of corresponding points along any two 

10 adjacent edges is joined by a line parallel to the other shared adjacent edge, the lines so 
drawn intersect at the vertices of the desired equal-sized and equilateral subdivision 
triangles, resulting in the creation of v 2 triangles per triangular face 528 of the 
icosahedron 526, or 20v 2 triangles in all. Of the resulting 10v 2 + 2 vertices, five 
triangular faces meet at each of the twelve original vertices of the icosahedron 526, and 

15 six triangular faces meet at the each of the remaining vertices. The twelve original 
vertices 532 already lie on the sphere, while the remaining vertices lie inside the sphere. 
Each created vertex is therefore centrally projected onto the sphere, giving the desired 
tiling. 

A sphere approximated by a regular polyhedron in this way is referred to as a 
20 geodesic, and the parameter v is referred to as the frequency of the geodesic. Figure 54 
shows an icosahedral geodesic 534 with v=3, i.e. with 180 faces 528. 

The closer a subdivision triangle is to the center of a face of the icosahedron 
526, the further it is from the surface of the sphere, and hence the larger it is when 
projected onto the sphere. To minimise variation in the size of projected subdivision 

25 triangles, subdivison vertices can systematically be displaced prior to projection 
(Tegmark, M., "An Icosahedron-Based Method for Pixelizing the Celestial Sphere", ApJ 
Letters, 470, L81, October 14, 1996). If v = 1 then no vertices are created and the angle 
subtended by a triangular face at a vertex remains 60°. As v increases, however, the 
surface defined by the five triangular faces surrounding each original vertex becomes 

30 increasingly flat, and the vertex angle of each triangular face converges on 72° (i.e. 360° 
/ 5). This defines the worst case for a tag tiling of a sphere. In a 72° isosceles triangle the 
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base length is 1.18 times the length of the two sides. The maximum tag spacing s for the 
purposes of calculating the sensor field of view is therefore close to 1 . 1 Sk. With a tag 
diameter of 256 dots and a quiet area of 16 dots, i.e. an effective tag diameter k of 272 
dots (—4.3 mm), m is therefore 643 dots (—10.2 mm) according to EQ 2. 

5 The angle subtended by each edge of an icosahedron at the center of the 

circumscribing sphere is given by EQ 5 

For a sphere of radius r the arc length of each centrally projected edge is r9. 
Given a tag diameter of K in the same units as r, the number of tags n required to cover 
the sphere is given by EQ 6. 

1 0 For a given w, r is limited by EQ 7. 

If n is limited to 2 16 , to allow the use of a 16-bit tag ID without requiring 
multiple regions to cover the sphere, and K is taken to be 4.3 mm as above, then r is 
limited to -3 1 0 mm. 

A typical globe has a radius of 160 mm. Its projected arc length of -177 mm 
15 fits 41 evenly spaced tags with negligible additional spacing. Such a globe uses 16812 
tags in total. 

8.1.3 Arbitrary Curved Surface Tag Tiling 

A triangle mesh can approximate a surface of arbitrary topography and 

topology without introducing discontinuities or singularities, with the local scale of the 
20 mesh being dictated by the local curvature of the surface and an error bound. Assuming 
the existence of a triangle mesh for a particular surface, an effective non-regular tiling of 
tags can be produced as long as each mesh triangle respects a minimum vertex angle and 
a minimum edge length. A tiling is considered effective with respect to a particular 
sensing device if the field of view of the sensing device is guaranteed to include at least 
25 one complete tag at any position of the sensing device on the surface. 

The tiling procedure starts by placing a tag at each vertex of the mesh, so the 
minimum edge length is the same as the tag diameter k. The tiling procedure proceeds by 
inserting a tag at the midpoint of any edge whose length exceeds a maximum tag 
separation s. As illustrated in Figure 9, the maximum tag spacing s is calculated so that if 
30 two adjacent tags 4a and 4b are a distance s + e apart, then there is room for another tag 
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4c between them, i.e. EQ 8. 

However, if the vertex angle between two edges of length s + s is less than 60°, 
then the inserted tags will overlap. 

To prevent inserted tags from overlapping, a minimum tag separation t is 
5 introduced, where t>k. The minimum vertex angle a then becomes a function of k and t, 
as shown in EQ 9, 

Clearly, when t = k, P is constrained to be 60°, i.e. the mesh is constrained to be 
equilateral. But as illustrated in Figure 56, when t > k 9 P can be less than 60° without 
inserted tags overlapping. 

10 The maximum tag separation s must be based on the new minimum tag 

separation t, in accordance with EQ 10. 

When considering a particular mesh triangle, there are four distinct tag 
insertion scenarios. By assuming that the minimum vertex angle is no less than 30° (i.e. 
half of 60°), it can be shown that whenever a mesh triangle has at least one edge less 
15 than or equal to s in length, the remaining two edges are less than 2s in length. In 
practice the minimum vertex angle is typically at least 45°. 

In the first scenario (Figure 57) no edges of a triangle 546 exceed s in length, so 
the tagging of the triangle is already complete. 

In the second scenario (Figure 58) one edge 548 of a triangle 550 exceeds s in 
20 length. A tag 552 is inserted at the midpoint of the edge 548 to complete the tagging of 
the triangle 550. 

In the third scenario (Figure 59) two edges 554, 556 of a triangle 558 exceed s 
in length. Tags 560, 562 are inserted at the midpoint of each of the two long edges 554, 
556 and this may complete the tagging of the triangle 558. Centers of the two inserted 
25 tags 560, 562 together with the two vertices 564, 566 of the short edge 568 of the 
original triangle 558 form a trapezoid. If either diagonal of the trapezoid exceeds s in 
length then a final tag 570 is inserted at the center of the trapezoid to complete the 
tagging of the triangle. 

In the fourth scenario (Figure 60) all three edges 572 of a triangle 573 exceed s 



NPS028US 




-88- 

in length. A tagged vertex 574 is inserted at the midpoint of each edge 572 and the three 
new vertices 574 are joined by edges 576. The tagging procedure is then recursively 
applied to each of the four resultant triangles 577, 578, 579 and 580. Note that the new 
triangles respect the minimum vertex angle because they have the same shape as the 
5 original triangle 573. 

The tag tiling variables are summarized in Table 4. 

Table 4. Tag tiling variables 



variable 


Meaning 


P 


minimum vertex angle 


k 


tag diameter 


m 


minimum diameter of sensor field of view on surface 


s 


maximum center-to-center tag spacing 


t 


minimum center-to-center tag spacing 



10 8.2 Tag Sensing 

8.2.1 Pen Orientation 

To allow a pen-like sensing device to be used as a comfortable writing 

instrument, a range of pen orientations must be supported. Since the pen nib is 
constrained to be in contact with the surface, the orientation of the pen can be 
1 5 characterized by the yaw (z rotation), pitch (x rotation) and roll (y rotation) of the pen, as 
illustrated in Figure 61 . While the yaw of the pen must be unconstrained, it is reasonable 
to constrain the pitch and roll of the pen as well as the overall tilt of the pen resulting 
from the combination of pitch and roll. 

Yaw is conventionally applied after pitch, such that, for example, in the case of 
20 a pen device it would define a twist about the physical axis rather than a direction in the 
surface plane. In a pen with a marking nib, however, the image sensor is mounted off the 
axis of the pen and the pen's image sensing ability (and hence its yaw sensing ability) is 
therefore constrained unless the pen is held almost vertically, as discussed below. Yaw is 
therefore applied before pitch, allowing the full yaw range to be specified by rotating the 
25 pen relative to the surface while keeping pitch and roll constant. 
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Pitch and roll are conventionally defined as y and x rotations, respectively. 
Here they are defined as x and y rotations, respectively, because they are defined with 
respect to the x-y coordinate system of the surface, where the y axis is the natural 
longitudinal axis and the x axis is the natural lateral axis when viewed by a user. In a 
5 right-handed 3D coordinate system, roll is conventionally defined as positive when 
anticlockwise, while pitch and yaw are conventionally defined as positive when 
clockwise. Here all rotations are defined as positive when anticlockwise. 

The pen's overall tilt (0) is related to its pitch (<(>) and the roll (vj/) in accordance 
with EQ 11. 

10 The pen's tilt affects the scale at which surface features are imaged at different 

points in the field of view, and therefore affects the resolution of the image sensor. Since 
it is impractical to sense the area directly under the pen nib, the pen's tilt also affects the 
distance from the nib to the center of the imaged area. This distance must be known to 
allow a precise nib position to be derived from the position determined from the tag. 

15 8.2.2 Image Sensing 

The field of view can be modeled as a cone defined by a solid half-angle a 

(giving an angular field of view of 2a), and an apex height of D above the surface when 
the optical axis is vertical. Although the image sensor is typically rectangular, only the 
largest elliptical subarea of the image sensor is relevant to guaranteeing that a 
20 sufficiently large part of the surface is imaged, as quantified earlier. 

The intersection of the field of view cone with the surface defines an elliptical 
window on the surface. This window is circular when the optical axis is vertical. 

Figure 62 illustrates the geometric relationship, for a given pitch-related tilt 0 
of the pen's optical axis, between the pen's nib (point A), the pen's optical axis (CE), 
25 and the field of view window (FH). The tilt is defined to be clockwise positive from the 
vertical. The equations which follow apply to both positive and negative tilt. 

When the pen is not tilted, the window diameter (i.e. | BD | ) is given by EQ 

12. 

If, when the pen is not tilted, the distance from the nib to edge of the window 
30 (i.e. | AB | ) is T 9 then the distance S from the nib to the center of the window (i.e. 
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I AC | ) is given by EQ 13. 

When the pen is tilted by 0, the distance from the viewpoint to the surface 
along the optical axis is reduced to d (i.e. I GE I ), given by EQ 14. 

The width of the window (i.e. | FH | ) is then given by EQ 15. 

5 D and a must be chosen so that an adequately large area is imaged throughout 

the supported tilt range. The required minimum diameter m of the area is given by EQ 4, 
while the width of the actual imaged area is given by EQ 15. This then gives EQ 16. 

Once D and a are determined, an image sensor resolution must be chosen so 
that the imaged area is adequately sampled, i.e. that the maximum feature frequency is 
; = 10 sampled at its Nyquist rate or higher. 

i y When imaged, the scale of the surface decreases with increasing distance from 

^ the viewpoint and with increasing inclination relative to the viewing ray. Both factors 

l!3 have maximum effect at point F for positive tilt and point H for negative tilt, i.e. at the 

point in the window furthest from the viewpoint. Note that references to F in the 
I : " 15 following discussion apply to H when the tilt is negative. 

| ~ The distance of point F from the viewpoint (i.e. | EF | ) is given by EQ 17. 

= 5 Scaling due to the inclination of the surface relative to the viewing ray through 

. F (EF) is given by EQ 18. 

If the surface feature frequency is/ then the angular surface feature frequency 
20 co at F (i.e. with respect to the field of view) due to both factors is given by EQ 19. 

When there is no object plane tilt (i.e. 9 = 0), this reduces to EQ 20. 

The image sensor is, by definition, required to image at least the entire angular 
field of view. Since the pixel density of the image sensor is uniform, it must image the 
entire field of view at maximum frequency. Given an angular field of view in image 
25 space of 2ct\ an image sensor tilt (i.e. image plane tilt) with respect to the optical axis of 
Q\ and a sampling rate of n (where n > 2 according to Nyquist' s theorem), the minimum 
image sensor resolution q is given by EQ 21 and EQ 22. 

The cos-squared term in the numerator in EQ 22 results from the same 
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reasoning as the cos-squared term in the denominator in EQ 19. 

When there is no image plane tilt (i.e. 0' = 0), and the image space and object 
space angular fields of view are equal (i.e. a' = a), this reduces to EQ 23 and EQ 24. 

When there is no object plane tilt (i.e. 0 = 0) this reduces further to EQ 25. 

5 When the image plane tilt and the object plane tilt are equal (i.e. 0' = 0), and 

the image space and object space angular fields of view are equal (i.e. a' = a), EQ 22 
reduces to EQ 26. 

Matching the image plane tilt to the object plane tilt therefore yields a smaller 
required image sensor size than when the image sensor tilt is fixed at zero, and 
1 0 eliminates perspective distortion from the captured image. Variable image sensor tilt is, 
however, a relatively cotly option in practice, and also requires greater depth of field. 

Figure 63 illustrates the geometric relationship, for a given roll-related tilt 0 of 
the pen's optical axis, between the pen's nib (point A), the pen's optical axis (CE), and 
the field of view window (FH). The tilt is again defined to be clockwise positive from 
1 5 the vertical. With the exception of EQ 13, the preceding equations apply equally to roll- 
induced tilt. For roll-induced tilt the distance S from the nib to the center of the window 
(i.e. | AC | ) is zero rather than as defined by EQ 13. 

For pitch-induced tilt, the magnitude of the tilt range is maximised by choosing 
a minimum (negative) tilt and a maximum (positive) tilt which have the same image 
20 sensor requirement. Since, for pitch-induced tilt, the surface is more distant for negative 
tilt than for positive tilt of the same magnitude, the minimum has a smaller magnitude 
than the maximum. For roll-induced tilt they have the same magnitude. 

As described above, the smallest features of the tag 4 are the structures which 
encode the data bits, and these have a minimum diameter of 8 dots. This gives a 
25 maximum feature frequency / of about 7.9 per mm at 1600 dpi. 

As calculated according to EQ 4 above, an equilateral triangular tiling of 256- 
dot diameter tags with no overlap between successive lines of tags requires a minimum 
field of view window diameter on the surface of 598 dots, or about 9.5 mm at 1600 dpi. 

Most people hold a pen at about +30° pitch and 0° roll. The inking ball of a 
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ball-point nib loses effective contact with the surface beyond about +50° pitch (i.e. 40° 
from the horizontal). A reasonable target pitch range is therefore -10° to +50°, and a 
reasonable roll range -30° to +30°, bearing in mind greater limitations on combined 
pitch and roll as given by EQ 11. 

5 The highly compact (1.5 mm 2 ) Matsushita CCD image sensor (Matsushita 

Electronic Corporation, and is described in a paper by Itakura, K T Nobusada, N 
Okusenya, R Nagayoshi, and M Ozaki, "A 1mm 50k-Pixel IT CCD Image Sensor for 
Miniature Camera System", IEEE Transactions on Electronic Devices, Volt 47, number 
1 , January 2000) is suitable for use in a compact device such as a pen. It has an available 
10 resolution of approximately 215x215 pixels. Assuming equal image and object space 
1 ^ angular fields of view, no image plane tilt, and a nib-to-window distance T of 4 mm, 

=1 optimizing the geometry using EQ 16 and EQ 24 to achieve the desired pitch and roll 

| y . ranges stated above yields a pitch range of -16° to +48° (64°) and a roll range of -28° to 

+28° (56°) with a viewing distance D of 30 mm and an angular field of view of 1 8.8° (a 
, 15 = 9.4°). The available pitch range is actually -21° to +43°, and this is mapped to close to 
H the desired range by pitching the optical axis at -5° relative to the physical axis. Note 

I y that the tilt range can be expanded slightly by optimizing a non-zero tilt of the image 

^2 plane. 

The overall pen tilt is thus confined to an elliptical cone whose major angle is 
20 64° in the pitch plane and whose minor angle is 56° in the roll plane. 

The image sensing variables are summarized in Table 5. 

Table 5. Image sensing variables 



variable 


meaning 


a 


Object space field of view half-angle 


a' 


Image space field of view half-angle 


Y 


Pen yaw 


e 


Object plane tilt (i.e. overall pen tilt) 


e' 


Image plane tilt 




Pen pitch 


w 


Pen roll 
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CO 


A\l 1 U Id I 1 1 d|UC7I l^y III 1 IC7ILJ Ul V ICVV 


D 


Normal viewing distance 


a 


Tilted viewing distance 


f 


Surface feature frequency 


n 


Sampling rate 


q 


Image sensor resolution 


s 


Distance from nib to center of field of view on 
surface (when 0 = 0) 




Distance from nib to edge of field of view on 


T 


surface (when 0 = 0) 



8.3 Tag Decoding 

8.3.1 Tag Image Processing and Decoding 

Tag image processing is described earlier in Section 1.2.4. It culminates in 

5 knowledge of the 2D perspective transform on the tag, as well as the decoded tag data. 

8.3.2 Inferring the Pen Transform 

Once the 2D perspective transform is obtained which accounts for the 

perspective distortion of the tag in the captured image, as described earlier, the 
corresponding discrete 3D tag transform with respect to the pen's optical axis can be 
1 0 inferred, as described below in Section 8.4. 

Once the discrete 3D tag transform is known, the corresponding 3D pen 
transform can be inferred, i.e. the transform of the pen's physical axis with respect to the 
surface. The pen's physical axis is the axis which is embodied in the pen's shape and 
which is experienced by the pen's user. It passes through the nib. The relationship 
1 5 between the physical axis and the optical axis is illustrated in Figure 64. 

It is convenient to define three coordinate spaces. In sensor space the optical 
axis coincides with the z axis and the the viewpoint is at the origin. In pen space the 
physical axis coincides with the z axis and the nib is at the origin. In tag space the tag 4 
lies in the x-y plane with its center at the origin. The tag transform transforms the tag 4 
20 from tag space to sensor space. 
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Sensor space is illustrated in Figure 64. The labelling of points in Figure 64 is 
consistent with the labelling in Figure 62. The viewpoint is at E, the sensed point is at G, 
and the nib is at A. The intersection point G between the optical axis and the surface is 
referred to as the sensed point. In contrast with the geometry illustrated in Figure 62 
5 where the nib is considered as a point, here the nib is considered as a small sphere. If the 
nib is curved, then the tilt of the physical axis affects the offset between the sensed point 
and the contact point between the nib and the surface. The center point K of the spherical 
nib, about which the physical axis pivots, is referred to as the pivot point. 

The nib makes nominal contact with the surface at point A when the optical 
10 axis is vertical. KA is defined to be parallel to the optical axis. When the pen is tilted, 
however, contact is at point L, as shown in Figure 65. Given the radius R of the nib, the 
distance of the pivot point K from the surface, e.g. at A or L, is always R. 

The discrete tag transform includes the translation of the tag center from the 
sensed point, the 3D tag rotation, and the translation of the sensed point from the 
15 viewpoint. 

Given the translation d of the sensed point from the viewpoint in the discrete 
tag transform, and according to EQ 14, the sensed point is given by EQ 27. 

Since the physical axis only differs from the optical axis by a y translation and 
x rotation (i.e. pitch), the physical axis lies in the y-z plane. With reference to Figure 64, 
20 where | AC I = S and | EC I = D (just as in Figure 62), it is clear that in sensor space the 
position of the pivot point is given by EQ 28. 

The vector from the sensed point to the pivot point is therefore given by EQ 

29. 

The vector from the pivot point to the contact point is by definition a surface 
25 normal of length R. It is constructed by applying the 3D tag rotation M to a tag space 
surface normal, normalizing the result, and scaling by R, as shown in EQ 30 and EQ 31. 

The vector from the sensed point to the contact point is then obtained in 
accordance with EQ 32. 

This is transformed into tag space by applying the inverse of the tag transform 
30 3D rotation, and is then added to the vector from the tag center to the sensed point, to 
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yield the vector from the tag center to the contact point in tag space, i.e. on the surface, 
in accordance with EQ 33. 

This is finally added to the tag's absolute location, as implied by its tag ID, to 
yield the nib's desired absolute location in the tagged region: see EQ 34. 

5 The final step is to infer the pen's 3D orientation from the tag's 3D orientation. 

The pen's discrete rotations are simply the inverses of the tag's discrete rotations, with 
the pen's pitch also including the effect of the pitch (§ S ensor) of the optical axis with 
respect to the pen's axis, as defined in EQ 35, EQ 36 and EQ 37. 

8.4 Inferring the Tag Transform 

10 The image of the tag 4 captured by the image sensor contains perspective 

distortion due to the position and orientation of the image sensor with respect to the tag. 
Once the perspective targets of the tag are found in image space, an eight-degree-of- 
freedom perspective transform is inferred based on solving the well-understood 
equations relating the four tag-space and image-space point pairs. The discrete transform 
1 5 steps which give rise to the image of the tag are concatenated symbolically, and a set of 
simultaneous non-linear equations is obtained by equating corresponding terms in the 
concatenated transform and the perspective transform. Solving these equations yields the 
discrete transform steps, which include the desired tag offset from the nib, 3D tag 
rotation, and viewpoint offset from the surface. 

20 8.4.1 Modeling the Tag Transform 

The transform of the tag 4 from tag space to image space can be modeled as a 

concatenation of the following transform steps: 

• x-y translate (by tag-to-viewpoint offset) 

• z rotate (by tag yaw) 
25 • x rotate (by tag pitch) 

• y rotate (by tag roll) 

• z translate (by tag-to-viewpoint offset) 

• perspective project (with specified focal length) 
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• x-y scale (to viewport size) 

These are concatenated symbolically to produce a single transform matrix 
which effects the tag transform. Table 7 summarizes the discrete transform variables 
used in the following sections, together with the range of each variable. 



5 

Table 7. Discrete transform variables and their ranges 



Variable 


Abbrev. 


Meaning 


Unit 

Licl Flo 1 (J II 11 


Range 


Y 


- 


yaw 


0 


0 < y < 2ti 


♦ 




pitch 


0 


-7t/2 < <|>< 7l/2 






roll 


0 


-Tt/4 < V|/ < tc/4. 


fx 


A 


tag-to-viewpoint x offset 


0 




ty 


B 


tag-to-viewpoint y offset 


0 




cosy 


C 


cosine of yaw 


1 


-1 < C < 1 


siny 


D 


sine of yaw 


0 


-1 < D < 1 


COS(|> 


E 


cosine of pitch 


1 


0<E<1 


sin<)> 


F 


sine of pitch 


0 


-1 < F < 1 


cosy 


G 


cosine of roll 


1 


0< G< 1 


sinvj/ 


H 


sine of roll 


0 


-1 < H < 1 


tz 


1 


tag-to-viewpoint z offset 




/<0 


MX 


J 


inverse focal length 




J>0 


S 




viewport scale 




S>0 



Translate in x-y plane by t x and t y according to EQ 42 (where A=t x and &=t y ). 

1 0 Rotate about z by y according to EQ 43 (where C=cos(y) and Z)=sin(y)), giving 

EQ 44. 

Rotate about x by <(> according to EQ 45 (where £=cos(<j>) and F=sin(<}>)), giving 

EQ 46. 

Rotate about y by \|/ according to EQ 47 (where G=cos(\|/) and i/=sin(\}/)), 
1 5 giving EQ 48, where K and L are defined by EQ 49 and EQ 50. 

Translate in z by t 2 according to EQ 51 (where I=t z ), giving EQ 52. 
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Perspective project with focal length X and projection plane at z=0 according to 
EQ 53 (where J=l/k), giving EQ 54. 

Scale to viewport by S according to EQ 55, giving EQ 56. 

Transform a point in the x-y plane (z=0) according to EQ 57, giving EQ 58. 

5 Finally, expand K and L, giving EQ 59. 

8.4.2 2D Perspective Transform 

Given an inferred eight-degree-of-freedom 2D perspective transform matrix as 

defined in EQ 60, multiply by an unknown i to obtain the general nine-degree-of- 
freedom form of the matrix, as shown in EQ 61. 

1 0 Transform a 2D point according to EQ 62, giving EQ 63. 

8.4.3 Inferring the Tag Transform 

8.4.3.1 Equating Coefficients 

Equating the coefficients in EQ 59 with the coefficients in EQ 63 results in 

EQ 64 to EQ 72, being nine non-linear equations in 1 1 unknowns. 

15 These equations are augmented as required by the trigonometric identity 

relating the sine and cosine of an angle (i.e. the sine and cosine of any one of yaw, pitch 
and roll), as shown in EQ 73. 

Given the sine and cosine of an angle, the corresponding angle is obtained 
using a two-argument arctan as shown in EQ 74. 

20 

8.4.3.2 Solving for X-Y Offset 

EQ 66 can be simplified using EQ 64 and EQ 65 to give EQ 75 and then EQ 

76. 

EQ 69 can be simplified using EQ 67 and EQ 68 to give EQ 77 and then EQ 

25 78. 

EQ 72 can be simplified using EQ 70 and EQ 71 to give EQ 79 and then EQ 

80. 
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EQ 76 can be re-written as EQ 81 5 and EQ 78 can be re-written as EQ 82. 

Equating EQ 81 and EQ 82 and solving for 8 yields EQ 83 through EQ 85 and 
finally EQ 86, which defines B. 

Substituting the value for B into EQ 82 and simplifying yields EQ 87 through EQ 
5 90 and finally EQ 91 , which defines A. 

This therefore gives the x-y offset of the tag 4 from the viewpoint, since A=t x 

and B=t y . 

8.4.3.3 Solvinq for Pitch 

92 can be obtained. 

93 can be obtained. 

92 and EQ 93, EQ 94 can be obtained. 
92 and EQ 93, EQ 95 can be obtained. 
92 and EQ 93, EQ 96 can be obtained. 
92 and EQ 93, EQ 97 can be obtained. 

98 can be obtained. 

99 can be obtained. 

100 can be obtained. 

101 can be obtained. 

From EQ 98 and EQ 99, EQ 102 and then EQ 103 can be obtained. 

20 From EQ 100 and EQ 101, EQ 104 and then EQ 105 can be obtained. 

From EQ 103 and EQ 105, EQ 106 and then EQ 107 can be obtained. 

EQ 107 only has a valid basis if G and if are both non-zero. Since |v|/| < 7t/2, the 
cosine (G) of the roll is always positive and hence non-zero. The sine of the roll (H) is 
only non-zero if the roll is non-zero. Specific handling for zero pitch and roll is 
25 described in Section 6.7.3.10. 

This therefore gives the magnitude of the sine of the pitch, since F=sin(<|>), and 
hence the cosine (E) of the pitch by EQ 73, according to EQ 108. 







From EQ 68, EQ 




10 


From EQ 67, EQ 






From EQ 64, EQ 


■r, '■ : t 




From EQ 65, EQ 


%M 




From EQ 70, EQ 
From EQ 71, EQ 




15 


From EQ 94, EQ 


In 




From EQ 95, EQ 


■3. _i 




From EQ 96, EQ 
From EQ 97, EQ 
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Since |<|>| < n/2, the cosine (E) of the pitch is always positive, so there is no 
ambiguity when taking the square root. The sign of the sine (i 7 ), however, must be 
determined by other means, as described in Section 6.7.3.9. 

Given E and F, the pitch is then obtained, according to EQ 109. 

5 8.4.3.4 Solving for Roll 

From EQ 103, EQ 110 can be obtained. 

From EQ 73, EQ 111 and then EQ 112 can be obtained. 

This therefore gives the magnitude of the sine of the roll, since 7/=sin(\|/), and 
hence the cosine (G) of the roll by EQ 73, according to EQ 113. 

10 Since |\|/| < 7i/4, the cosine (G) of the roll is always positive, so there is no 

ambiguity when taking the square root. The sign of the sine (H) 9 however, must be 
determined by other means, as described in Section 6.7.3.9. 

Given G and H, the roll is then obtained according to EQ 114. 

8.4.3.5 Solving for Yaw 

1 5 From EQ 73, EQ 92 and EQ 93, EQ 115 and then EQ 116 can be obtained. 

From EQ 92 and EQ 116, EQ 117 and then EQ 118 can be obtained. 

From EQ 92 and EQ 116, EQ 119 and then EQ 120 can be obtained. 

In EQ 116, and hence EQ 118 and EQ 120, the sign of the square root is 
determined by the sign of /, which can be determined from EQ 80, giving EQ 121. 

20 Since / (t z ) is negative, J is positive, and IJ < -1 (because \t 2 \ > A,), then 

EQ 122 holds. 

Given C and D 9 the yaw is then obtained according to EQ 123. 

8.4.3.6 Solving for Viewport Scale 

The cosine (Q and sine (D) of the yaw are by definition never simultaneously 

25 zero. Since the cosine (E) of the pitch is never zero, either EQ 67 or EQ 68 can therefore 
always be used to determine the viewport scale (5). 

If D is non-zero, then from EQ 67, EQ 124 can be obtained. 
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Otherwise, if C is non-zero, then from EQ 68, EQ 125 can be obtained. 

8.4.3.7 Solving for Focal Length 

Similarly, since the cosine (G) of the roll is never zero, either EQ 70 or EQ 71 

can be used to determine the inverse focal length (J), so long as either the pitch or roll is 
non-zero. However, the signs of the sines (F and H) of the pitch and roll may not be 
known. However, the sign of the product (FH) of the sines of the pitch and roll is given 
by EQ 103, as shown in EQ 126. 

The sign can be assigned arbitrarily to F, since the sign of J is known a priori. 
If gi is non-zero, then from EQ 70, EQ 127 can be obtained. 

If hi is non-zero, then from EQ 71, EQ 128 can be obtained. 

In practice, the choice between using EQ 127 and EQ 128 is based on which of 
gi and hi has the larger magnitude. The inverse focal length is unknown if gi and hi are 
both zero, i.e. if the pitch and roll are both zero. 

8.4.3.8 Solving for Z Offset 

Once the inverse focal length (J) is known, the z offset (I) is obtained from EQ 

80, according to EQ 129. 

Again, the z offset (I) is unknown if the inverse focal length (J) is unknown, i.e. 
if the pitch and roll are both zero. 

8.4.3.9 Determining Direction of Pitch and Roll 

The sign of the product (FH) of the sines of the pitch and roll is given by EQ 

126. Since —n/4 < \|/ < n/4, a roll adjustment of +7i/4 can be introduced to ensure the roll 
is always positive, without invalidating any other assumptions. Once the roll adjustment 
is introduced, EQ 126 gives the sign of the sine (F) of the pitch alone. 

The roll adjustment is introduced as follows. The viewport scale (S)> inverse 
focal length (J) 9 and z offset (I) are all computed as described. A 3D transform matrix is 
created from the 2D perspective transform matrix. The inverses of the viewport scale, 
focal length projection and z translation are applied to the 3D matrix in reverse order. 
The roll adjustment is then applied by pre-multiplying the matrix by a n/4 y rotation 
matrix. The roll, pitch and yaw are computed as described. Since the roll is positive, the 
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pitch direction is now known. The 7i/4 roll adjustment is finally subtracted from the roll 
to give the actual roll. 

When the roll and pitch are both zero, the focal length and z offset are both 
unknown as described above. However, in this case there is no need to adjust the roll 
5 since the pitch and roll are already known. 

8.4.3.10 Handling Zero Pitch and Roll 

When either the pitch or roll is zero, the general solution based on EQ 107 

becomes invalid. The table of Figure 85 shows the 12 degenerate forms of EQ 64 
through EQ 71 which result when the yaw is variously zero (or 7t), 7i/2 (or 37t/2), and 
1 0 non-zero, and the pitch and roll are variously zero and non-zero. The table of Figures 86 
and 87 sets out the required logic for detecting and handling cases where the pitch and/or 
roll are zero, with each case motivated by zeros appearing in the table of Figure 85. The 
cases in the table of Figure 85 are labelled with the case numbers from the table of 
Figures 86 and 87. 



15 Conclusion 

The present invention has been described with reference to a preferred 
embodiment and number of specific alternative embodiments. However, it will be 
appreciated by those skilled in the relevant fields that a number of other embodiments, 
differing from those specifically described, will also fall within the spirit and scope of 
20 the present invention. Accordingly, it will be understood that the invention is not 
intended to be limited to the specific embodiments described in the present specification, 
including documents incorporated by cross-reference as appropriate. The scope of the 
invention is only limited by the attached claims. 
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